Web Hosting Talk







View Full Version : Advice on moving 98 GB of data?


Fahd
03-05-2004, 02:46 AM
How would you recommend moving 98 GB of data across data centers from a remote location?

tar+wget, rsync, scp???

Any pitfalls to watch out for?

Thanks! :)

net-trend
03-05-2004, 02:54 AM
Recommendation, do it in blocks. :)

Fahd
03-05-2004, 03:00 AM
Thats rather cumbersome to do since directory structure is like:

/Primary Folder
/Primary Folder/0123456789ABCDEF (16 random digit directory names, a few thousand of them actually!)
/Primary Folder/0123456789ABCDEF/0123456789ABCDEF.file (16 random digit file names, again a few thousand of them though extensions are the same!)

justbenice
03-05-2004, 03:12 AM
i would spilit a file to smaller , about 4,5G each and sftp . :) . I'v move a file 18G before and i spilit it to 2 file which 9 G each.

TheVoice
03-05-2004, 03:20 AM
scp has proven to be reliable for me. I suggest setting up a screen session for the scp and then detach it. This way the scp session won't mess up if you lose your connection to the server.

makesecure
03-05-2004, 04:03 AM
Just adding few more ideas to think about.....

Is this data something no-one can see? Although SCP provides security and TCP for better recovery procedures, if your network connection between the two servers is GREAT and you do NOT worry about security, tar+gzip the file and FTP it via UDP to offload some of that extra packet overhead on each MTU.

I like the idea of spliting up the files, just in case your transfer dies in the very end. These "random" directories you have, can you do some "df" searches to find out which take up the most space and then find a combination on how to tar only the ones starting with 1, or 2 or 3 ...etc?

You probably want to check your peak/off-peak hours and use those times to transfer your files...

Try a dry run with 1gig file... play a little ..

cheers

Fahd
03-05-2004, 04:16 AM
Originally posted by makesecure
...These "random" directories you have, can you do some "df" searches to find out which take up the most space and then find a combination on how to tar only the ones starting with 1, or 2 or 3 ...etc?...

Yes, and that would bring me down to 36 BIG files to transfer.

Correction: All directories and filenames are 32 characters long and not 16 digits as noted above! :o

WCHost
03-05-2004, 04:38 AM
buy the harddrive from the host.
I guess it will be easier~

Now I have to type longer because WHT won't let me write another message in another 90 seconds!

thedavid
03-05-2004, 04:43 AM
Wow, so they're *all* in /primary folder/?

I'd give rsync a try, run it in the background and check on it occasionally. If there's a ton of files though, the first step in it (building the file list) is going to take ages.

Failing that - buy the hard drive. ;)

Fahd
03-05-2004, 04:58 AM
Yes, they are all in /primary folder/

If only I could just buy the HDD, unfortunately some companies wont easily sell you the HDD and/or without HUGE markups! (And no guessing the provider will not get you any rewards! ;) )

msh
03-05-2004, 01:16 PM
tar and gzip the file and the ftp it. It has worked fine for me. Alternatively you can split the tar file in several gzip files.

Oopsz
03-05-2004, 01:45 PM
Rsync it over ssh. Use archival settings and compression. Easy, clean, and secure.

zagadka
03-05-2004, 03:16 PM
Tar it then

you may split if you donot want to use scp other way

only is scp