The files you see from the login node are typically shared among all nodes in the cluster. But use this login machine only for compiling and general tasks. For running parallel jobs, use the queuing system discussed below to access the computer cluster. To copy a dataset from csil.cs.ucsb,edu, allocate one node and use scp in that node to copy data from csil.cs.ucsb.edu.
Every class user is allocated with a certain amount of service units. Use the gbalance -u username command to check the amount of time in your account.
Your account is billed according to the formula:
#CPUs x #nodes x wall-clock-time.SU charge is rounded to the nearest CPU node hour. For example, if you run with 8 CPUs (cores) on 1 node for 12 minutes, your account will be charged 2 SUs.
It is easy to accidentally chew through your time allocation by running a program with dead lock or leaving an interactive session open. One thing you can do is to always specify a maximum time the job can execute. For example, 10-20 minutes.
qsub -I -l nodes=2:ppn=1 -l walltime=00:15:00to start an interactive job on two nodes (each with 1 core) for at most 15 minutes.
Once you run it, it will wait until a node is available, and then it will create an SSH session to that node. Your job will last until you exit that SSH session.
Caveats: interactive jobs will be scheduled just like script-based jobs, so they'll take as long to execute as the queue is. Right now, this doesn't seem to be a problem, so running with the batch queue is fine. If the queue is slow, remember to stop it (control-C works) so that you don't waste a bunch of time doing nothing.
qsub job-script-filewill submit your job to the scheduler and assign it a job number. Be careful when you are submitting multiple jobs. The output files will get overwritten by subsequent or concurrent jobs writing to the same file.
qdel jobnumber
where jobnumber was assigned by qsub.