Nevis Linux Cluster Disk Guide This is a guide to issues with disk storage on the on the Linux cluster. To find out how disks on one system can be access from another, be sure to read the automount page.

Topics on this page include:

  • /home versus /data partitions (or "how much disk space do I have?");
  • sorkstations versus server versus file server (or "where should your home directory go?");
  • backup policy;
  • long-term data storage (RAID arrays, /data directorys, and large data files).


How much disk space do I have?

To find out how much disk space you have available, use the df command. You'll probably always want to use the -h option, so the sizes appear in human-readable form:

df -h You'll almost certainly see disks in the list that are mounted via automount. Remember, such disks are not part of your system, and you should never attempt to visit the .automount directory directly; it's a fiction. If you find the automounted disks to be distracting, add -l to the command: df -hl

Here's the result of executing df -hl on the machine tanya on 28-Jan-2004:

Filesystem Size Used Avail Use% Mounted on /dev/hda6 487M 161M 301M 35% / /dev/hda1 46M 9.2M 35M 22% /boot /dev/hda7 62G 32G 27G 55% /data /dev/hda2 5.8G 2.3G 3.2G 42% /home none 504M 0 504M 0% /dev/shm /dev/hda3 3.9G 2.9G 842M 78% /usr AFS 8.6G 0 8.6G 0% /afs If we ignore AFS and the partitions that relate to the operating system, we're left with two key filesystems: /home and /data. (Some systems also have a /work partition; the same rules apply to /work as to /data.)

On most of the systems in the cluster, /home is smaller than /data, sometimes much smaller; it's usually 4-6 GB for workstations and 15-35 GB for servers. It's intended that the /home be used for "source" files (program code, scientific papers, mail, etc.); /data should be used for large and re-creatable files (compiled binaries, data summaries, temporary work files, etc.).

These partitions also have different permissions set. Only a systems administrator can create a new directory in the /home partition; typically, such directories are for user's home directories. Anyone in the same group as the /data can create a file or directory there -- but after that, only the person who created the file/directory can delete it (the "sticky" bit is set).

An aside: Although you can do so, never turn on group-write permission on your home directory. You'll interfere with both ssh and mail (your ~/.forward file won't be recognized).

If you're just skimming this page, stop and read this paragraph:

One final difference between /home and /data: the /home partition is backed up; /data is not. In fact, it goes one step further: the /data partition is always considered expendable for any type of system maintenance activity; if a system is being repaired, upgraded, or restored, the /data partition will probably be erased. (There's some advice on what do about this below.)

What do I do if I need more disk space?

First, look to /data (or perhaps /home) partitions on other systems. If you use a workstation, the disk space on the server is always available to you. The /data partitions on all the systems that belong to a group are intended to be a shared resource; if you don't have enough space on /a/data/yourmachine, cd /a/data/othermachine in your group and see how much free space it has.

I strongly advise you to exercise common courtesy as you're scrounging for disk space. If I found someone had used a big chunk of my workstation's /data partition without asking, I might be annoyed. Also, be aware that a workstation user can reboot that system at any time; don't assume 100% availability of files stored on another user's workstation.

If you still don't have enough disk space on all your group's machines to satisfy your needs, you may have to request more disks be added to the existing systems (or buy a new box).


Where should my home directory go?

In a UNIX-based system, the home directory is your primary workspace. Your personal configuration files go here, your most important files will probably be stored here, and your home directory is key in the processing and storage of your e-mail.

When a systems administrator creates an account on the Nevis Linux cluster, he has a choice on where to place your home directory:

  1. Your desktop workstation.

    For users who have a system on their desk, this is usually the best solution. They don't usually have to share their /home partition with other users, and so have the freedom to manage their disk space. Also, although you can submit jobs on any system on which you can login, it's easier to submit jobs on your own workstation.

    The disadvantage of this option is that workstations are usually less robust systems than the servers. Their disk drives, at least in theory, may not last as long as those in servers (although this has not been the case with newer workstations in recent years). Also, after a power outage, the servers will usually reboot automatically while a workstation may not -- or a workstation may boot faster than the servers, and not have access to some of the services.

    (Before you're discouraged from this option, it may help to know that this is the option I choose.)

  2. Your workgroup's server.

    If you don't have a system on your desk, this will probably be the option that the systems administrator will choose. But even users with a desktop workstation have chosen this option. Typically, the issue is whether the desktop system reliably reboots after a power outage.

    However, this option has its disadvantages. You have to share the /home partition with other users. Also, we have the speed paradox: because workstations are purchased more frequently than servers, the processor speed of a workstation is often faster than that of the workgroup's server. (The servers have other speed advantages, though: faster hard drives and dual processors.)

  3. The archive file server.

    Generally, this option is reserved for those users who were never part of any research group at Nevis (e.g., the administrative staff), or who never made use of the Linux cluster.

Incidentally, home directories can be moved; if you're unhappy with its location, you can ask for a change.


How often are backups made?

The Nevis Linux cluster is normally backed up nightly onto shelley, the Nevis backup server. This includes the systems at the Nevis Annex. The Windows systems at Nevis are also backed up by this server.

Actually, we don't copy every file from every system; we use a program called rsync to copy over only those files that have changed since the day before.

While the entirety of the mail and archive directories (on franklin and archive) are backed up, we don't back up every file on every system on the cluster. The policy is: the /home partition is backed up; /data is not. There is a web page that contains the list of which partitions are backed up.

We maintain previous versions of old files on shelley. (Actually we do an incremental tar of the disk images after the rsync procedure has run for all the machines in the cluster.) This means we can recover old versions of files if necessary. However, there's a time limit: we only keep old file versions for three months. We cannot recover files that were deleted or overwritten prior to that.

At this point we usually get these questions:

  1. Why don't you back up /data partitions?

    We have vastly more disk storage on the Nevis cluster than we can hope to back up on any system that we can afford. As of 11-Dec-2006, of roughly 12TB of disk storage on the cluster, we back up ~1TB.

    We therefore have to ask users to segregrate their files into key files that will be backed up, and re-creatable files that won't. The relative sizes of /home versus /data partitions help enforce this segregation.

  2. I've got files in a /data partition that would be a pain to re-create. How can I back them up myself?

    The simplest thing to do is to make copies on other /data partitions in your workgroup's cluster. After all, to first order, that's all a backup is: a second copy of your files.

    Another possibility (if the files are small enough) is to consider backing them up on CDs or DVDs, assuming that your workstation has a burner. For Linux systems, I recommend k3b, which is installed on most of the systems of the Nevis cluster.

  3. Why only three months worth of backups and versions? Why not a year?

    We're doing what we can with the resources we have available. We don't have the disk space for a year's worth of backups.

  4. I've got critical files that I want backed up even more often. What can I do?

    You can supplement our backups with copies of your own. For example, I have my own private backup procedure for my critical source files. The procedure makes use of the rsync command; you can see it in ~seligman/bin/rsync.sh on the Linux cluster.

    I run this script automatically a few times per day using cron. Here's a sample line from my crontab file:

    10 */6 * * * /a/home/tanya/seligman/bin/rsync.sh This translates to: Every six hours, at ten minutes past the hour, run my script.

    As you look over my files, notice that I copy a subset of my home directory onto a data disk on another machine. Abusing this facility by backing gigabytes of files in your home directory onto someone else's system will probably get you yelled at.


Long-term data storage

For the purposes of this section, "long-term" means more than six months or so.

By the above definition, there is no long-term data storage at Nevis. As noted above:

If you need long-term storage for any of your files, I suggest you consider the facilities at BNL, FNAL, or CERN.

Once again, I offer the advice I gave above: If you want to assure that files stored in a /data directory will be preserved, make a copy of them on a second system's /data directory.


Back to the Nevis Linux cluster Page.

Return to the Nevis Computing Page.

Up to the Nevis Home Page.

E-mail: Send any comments or questions to the webmaster.