IntroductionThis is part of a series of articles on backing up computers. The top page is Design for an Archiving Backup System.
Experiment 1Write a program to tally the MD5 digests across a set of machines via the administrative shares. Examine the results to determine:
filehash.txtThis contains the chunk hash data, each line is of the format:
fileNumber blockNumber BID
filename.txtThis contains the file object data, each line is of the format:
driveNumber fileNumber fileSize fileUNCName
Some Estimation StatisticsOn FLARE (a Windows 2K box) there are about 1591MB in 18364 system files and installed applications, for about 86K/file. Plus there is 5480MB of user data in 3934 files, for about 1400K/file.
On GALAXY (a Windows XP box) there is about 3867MB in 36650 files (system and apps), for about 105K/file.
On NOVA (a Windows 4.0 box) there is about 2476MB in 24943 files (system and apps), for about 100K/file, but on the data drives 5141MB in 47909 files for 107K/file and 138170MB in 84778 files for 1630M/file on the network media drive (largely audio and digital photographs, but some video).
So figure on 50-100K files per machine in a network, and each file typically would be about 100K in size, giving about 5-10GB per machine. For the case of a file server that is holding a central repository of media files the number of files probably will not be vastly larger, but the average size of each file may be much larger (16 times in the above case).