Main File Server status

Predrag Punosevac predragp at andrew.cmu.edu
Tue Apr 3 19:49:43 EDT 2018


Predrag Punosevac <predragp at andrew.cmu.edu> wrote:

> Dear Autonians,
> 
> As I indicated earlier in my e-mails I no longer can hold rebuilding the
> main file server. In my tests remote replications appear to be
> consistent.
> 
> I have just mounted backup copies of /zfsauton/project and
> /zfsauton/data folders from my backups on gpu9 and lov1 computing nodes.

backup copies which are physically located on Uranus of
/zfsauton/project and /zfsauton/data ZFS datasets are now mounted/live
to all computing nodes but not on desktops. The snapshots are taken as
usual but no remote replication is currently performed.

If everything looks OK in next 24h I will destroy the copies of these
two datasets on the main file server in order to create enough space to
be able to snapshot and backup /zfsauton/home dataset which was not the
case for the past few weeks. This is how the ZFS pool looks right now

[root at gaia] ~# zpool list
NAME       SIZE  ALLOC   FREE    CAP  DEDUP  HEALTH  ALTROOT
zfsauton  27.2T  24.6T  2.61T    90%  1.00x  ONLINE  /mnt

I have to drop load to under 80% in order to be able to backup your home
directories before rebuilding the server.

Predrag



> If you have few minutes to spear please log into these two servers and
> check for yourself if everything passes the smell test. Please let me
> know ASAP if everything is OK. 
> 
> Unless we discover something unexpected I will stop snapshots and remote
> replications of those two data sets on the main file server at 4:00 PM
> shortly afterward start umounting those shares and mounting them from
> the backup copy. If you have important experiments which uses data in
> those directories you need to speak right now. All processes have to be
> stopped for me to umount existing shares and mount new shares. I
> appreciate your cooperation in this matter.
> 
> Best,
> Predrag


More information about the Autonlab-users mailing list