Performing a Linux workstation capture

Use the following procedures to capture data using a Linux workstation. See Data capture for more information on data capture options.


You need to know the following information in order to capture data from a Linux workstation:

In addition, make sure that:

Installing the Capture Utility on a Linux workstation

Use the following procedure to install the Capture Utility on a Linux workstation.

  1. On a workstation, connect to the Transfer Appliance web interface.

  2. Select Download Linux Capture utility in the Data Capture menu to download the Capture Utility installer. The Capture Utility contains configuration information specific to the Transfer Appliance from which it was downloaded. Only download the Capture Utility from the appropriate Transfer Appliance.

  3. Extract the downloaded archive.

    This creates the CaptureUtility directory containing the installer script,

    tar -xvzf CaptureUtility.tar.gz
  4. Run the in the CaptureUtility directory:

       cd CaptureUtility

    The Capture Utility is installed in the same CaptureUtility directory .

Testing connectivity to the Transfer Appliance

Before running your first data capture job, confirm that the workstation can connect to the Transfer Appliance.

  1. On the Linux workstation, open a terminal window.

  2. Test connectivity with the Transfer Appliance by running with the -t option.

     ./ -t
  3. You are prompted to enter the capture user password.

     Enter Transfer Appliance <Transfer Appliance IP address> "cuser" password:

    The utility tests connectivity between the workstation and Transfer Appliance and returns results similar to the following:

    Test: Ping Transfer Appliance (
    Test: SCP to Transfer Appliance (
    Test: Control connection with Transfer Appliance (
    Test: Return connection from Transfer Appliance to this workstation, (could take several minutes)...OK

    If the connectivity test fails, refer to the error message returned to determine the cause. The most common reason for connectivity test failure is a firewall blocking the ports needed for data capture. See Preparing the Network for details on which ports need to be open.

Performing a workstation capture from a Linux workstation

Before you start a Linux workstation capture, make sure you have the following permissions to your data source:

  • Read and execute permissions for folders.
  • Read permissions for files.

If you are only capturing data that you own, you already have the required file and folder permissions. If you want to capture data owned by other people, contact your IT administrator and ask for an account that grants you access to the data. If you are in IT administrator in charge of moving other people's data, use a backup operator or service account that has read access to all of the data.

After checking connectivity, use the Capture Utility to start a data capture job.

  1. Run and specify a job name and capture target:


    where [JOB NAME] is the name of the data capture job and [CAPTURE DIRECTORY] is the directory that contains the data to capture. The Capture Utility recursively captures all data in the directories under the one specified.

    Use a meaningful job name. A job name is used to identify a capture job and the files it contains for the rest of the data migration project. A job name can contain alphanumeric, underscore, and hyphen characters.

  2. When prompted, enter the capture user password provided to you by Google:

    Enter Transfer Appliance <Transfer Appliance IP address> "cuser" password:

    The capture job runs and displays a completion message when it is finished. Leave the terminal window open while the capture job is running, or the job will be terminated. It takes up to ten minutes for the Transfer Appliance Web interface to display a "Failed" status.

    For example, the following command creates a job named data-capture to capture the data in the directory /mnt/data and its subdirectories.

    ./ data-capture /mnt/data

Specifying data paths for capture

You can specify data paths for capture in one of two ways:

  • If you want to capture a single directory or file, provide a single directory when running a capture job. This method captures data from the specified directory and its subdirectories.
  • To capture multiple target files and directories, run with the -f option. You must provide a line delimited text file containing absolute paths to the files you want to capture.

The following is an example of the contents of the text file you must provide when you use the -f option.


The following command creates a job dataFactory that captures data from the above targets specified in /home/user/filespec.txt.

./ dataFactory -f /home/user/filespec.txt

For more information about Capture Utility options, see Capture utility reference.

To direct the Capture Utility to skip symbolic links to files, use the -e option with For example, the following command creates a job named data-capture to capture the data in the directory /mnt/data (including its subdirectories), but skips symbolic links to files.

./ data-capture /mnt/data -e

For more information about the options for the Capture Utility, see Capture utility reference.

Capturing file metadata

The Capture Utility ( preserves the following file metadata by default:

  • The user ID of the owner.
  • The group ID of the owning group.
  • The file permissions.
  • The last modification time of the file.
  • The last file access time.

For example, the following command creates a job named data-capture to capture the data and file metadata in the directory /mnt/data and all subdirectories.

C:\Program Files\TA> data-capture /mnt/data1

For more information about the options for the Capture Utility, see Capture utility reference.

Next steps

To perform a parallel data capture task, follow these instructions on a separate workstation. You can also perform a Microsoft Windows workstation capture using the same Transfer Appliance.

If your data size exceeds the capacity of a single Transfer Appliance, capture your data using multiple appliances in succession.

To retry a transfer job, see Retrying unsuccessful data capture jobs.

To cancel a transfer job, see Canceling transfer jobs.

To monitor:

If you are done capturing data, see Preparing and shipping an appliance.