Main File Server status
Predrag Punosevac
predragp at andrew.cmu.edu
Tue Apr 3 19:49:43 EDT 2018
Predrag Punosevac <predragp at andrew.cmu.edu> wrote:
> Dear Autonians,
>
> As I indicated earlier in my e-mails I no longer can hold rebuilding the
> main file server. In my tests remote replications appear to be
> consistent.
>
> I have just mounted backup copies of /zfsauton/project and
> /zfsauton/data folders from my backups on gpu9 and lov1 computing nodes.
backup copies which are physically located on Uranus of
/zfsauton/project and /zfsauton/data ZFS datasets are now mounted/live
to all computing nodes but not on desktops. The snapshots are taken as
usual but no remote replication is currently performed.
If everything looks OK in next 24h I will destroy the copies of these
two datasets on the main file server in order to create enough space to
be able to snapshot and backup /zfsauton/home dataset which was not the
case for the past few weeks. This is how the ZFS pool looks right now
[root at gaia] ~# zpool list
NAME SIZE ALLOC FREE CAP DEDUP HEALTH ALTROOT
zfsauton 27.2T 24.6T 2.61T 90% 1.00x ONLINE /mnt
I have to drop load to under 80% in order to be able to backup your home
directories before rebuilding the server.
Predrag
> If you have few minutes to spear please log into these two servers and
> check for yourself if everything passes the smell test. Please let me
> know ASAP if everything is OK.
>
> Unless we discover something unexpected I will stop snapshots and remote
> replications of those two data sets on the main file server at 4:00 PM
> shortly afterward start umounting those shares and mounting them from
> the backup copy. If you have important experiments which uses data in
> those directories you need to speak right now. All processes have to be
> stopped for me to umount existing shares and mount new shares. I
> appreciate your cooperation in this matter.
>
> Best,
> Predrag
More information about the Autonlab-users
mailing list