us to any pages on the internet that tell how to setup the indexes and any
other parameters needed to get this to work?
If it's easy to setup or can be scripted, it might be a great way to get
BackupPC_nightly to run a lot faster.
Thanks,
Leon
-----Original Message-----
From: Yaroslav Halchenko [mailto:yoh < at > psychology.rutgers.edu]
Sent: Friday, August 06, 2004 8:31 AM
To: Craig Barratt
Cc: backuppc-users < at > lists.sourceforge.net
Subject: Re: [BackupPC-users] locating 'existing' files... or just how to
report all hardlinked files
On Thu, Aug 05, 2004 at 11:12:55PM -0700, Craig Barratt wrote:
Unfortunately the current pool/hardlink structure makes it very
expensive to find all files that have the same inode
(ie: identical files). I don't know of a faster approach than finding
the inode of the file and then searching the entire pc directory tree
for that inode:
ls -li /data/BackupPC/pc/HOST/123/fx/fy/fz
find /data/BackupPC/pc -inum 123456789 -print
where 123456789 is the inode number displayed by ls -li.
That's pretty expensive.
Yes - that is expensive and that is why I suggested to use 'locate'
approach - index all such finds and using existing locate program + wrapper
do such finds in 1-7 seconds.I've tested on our pool repository of 200GB
from 8 nodes with around 5 backups each...works like a charm
backuppc < at > sink:~$ time ./ilocate cpool/3/b/f/3bf2ac4a1407590a535b63cce0935185
5017508 cpool/3/b/f/3bf2ac4a1407590a535b63cce0935185
5017508 pc/ravana/10/f%2fraid/fhome/fmatsuka/fmatwork/fs2all.mat
5017508 pc/ravana/10/f%2fraid/fresearch/fhaxby/fhaxbydata/fs2all.mat
5017508 pc/ravana/3/f%2fraid/fhome/fmatsuka/fmatwork/fs2all.mat
5017508 pc/ravana/3/f%2fraid/fresearch/fhaxby/fhaxbydata/fs2all.mat
real 0m5.835s
user 0m2.697s
sys 0m0.068s
The first number is inode number... And now I know that the guy probably
copied that huge file from /research to his home directory...
--
Yaroslav Halchenko
Research Assistant, Psychology Department, Rutgers
Office (973) 353-5440 x263
Ph.D. Student CS Dept. NJIT
Key http://www.onerussian.com/gpg-yoh.asc
GPG fingerprint 3BB6 E124 0643 A615 6F00 6854 8D11 4563 75C0 24C8
-------------------------------------------------------
This SF.Net email is sponsored by OSTG. Have you noticed the changes on
Linux.com, ITManagersJournal and NewsForge in the past few weeks? Now, one
more big change to announce. We are now OSTG- Open Source Technology Group.
Come see the changes on the new OSTG site. www.ostg.com
_______________________________________________
BackupPC-users mailing list
BackupPC-users < at > lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/
