From jmjoseph at andrew.cmu.edu Tue Dec 7 14:25:55 2004 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Tue, 07 Dec 2004 14:25:55 -0500 Subject: [auton-users] Reservations/Condor Testing Message-ID: <41B603C3.5050400@andrew.cmu.edu> To further test the Condor job batch submission system, mentioned last week, I have reserved 3 more lops, for a total of: lop2, 5, 6, 7, loq1, and loq3. Lop1, 3, 4, loq2, and lor1 are still available for general use. If you've got jobs to run, I'd appreciate wider testing at this point. Jobs may be submitted using lop1, with the following directions. 1. Create a spec file to tell Condor how to configure and run your job. It is of the form: -------------------------------------- # configure the job Executable = Arguments = foo1 foo2 foo3 Image_Size = 1300 Meg Universe = vanilla Error = condor.err.$(Process) Output = condor.output.$(Process) Log = condor.log.$(Process) # submits the job Queue -------------------------------------- Everything but 'Executable', 'Universe', and 'Queue' are optional, but the 'Image_Size' is very important for matching with an appropriate machine to run on. The Error, Output, and Log are the respective outputs and '$(Process)' is just a number of the run. If you were to queue multiple runs with a number after 'Queue', the process would increment for each one. It is important that your job require nothing other than the Arguments here. That is, it's not going to have any of your standard environment variables, path or any such thing. I do recommend absolute paths if in doubt. You certainly can get around this by using a script with the appropriate environment as your executable. If you're running java code, this means you'll have to set the CLASSPATH as a java argument. Do note that you may also use "Getenv = True" to submit the job with all of your current environment variables. 2. Submit the spec file using the 'condor_submit ' command. You'll then see it added to the queue, visible with 'condor_q' and subsequently matched to an available machine. You may observe a short delay before the job executes. The status of the cluster machines may be observed with 'condor_status'. Of course, each of these commands has any number of options, but hopefully this will get you started. Do note that in order to allow single jobs to utilize all memory in a machine, 'condor_status' will report an extra vm (CPU) in each machine. Don't believe it. Each is still a dual CPU and 2 jobs will be run provided they each use less than half the available memory. There is a comprehensive user manual available at: http://www.cs.wisc.edu/condor/manual/v6.7/2_Users_Manual.html Thanks, -Jacob From adubr at yahoo.com Tue Dec 7 18:59:11 2004 From: adubr at yahoo.com (Artur Dubrawski) Date: Tue, 7 Dec 2004 15:59:11 -0800 (PST) Subject: [auton-users] Big Thanks! Message-ID: <20041207235912.69120.qmail@web20105.mail.yahoo.com> Dear Friends, Thank you very much for the basket which arrived here in the afternoon and which made such a nice surprise. Keep sending them baskets and I will stay here longer :-) Hope to see you all soon. Artur __________________________________ Do you Yahoo!? Yahoo! Mail - Find what you need with new enhanced search. http://info.mail.yahoo.com/mail_250 From dongryel at andrew.cmu.edu Fri Dec 10 17:50:07 2004 From: dongryel at andrew.cmu.edu (Dongryeol Lee) Date: Fri, 10 Dec 2004 17:50:07 -0500 (EST) Subject: [auton-users] Lor1 reservation request Message-ID: Hi, I need to reserve Lor1 for experiments. I think I will be done by next Friday. Dongryeol Lee Computer Science/Mathematical Sciences Senior 05' From jmjoseph at andrew.cmu.edu Sat Dec 11 11:59:07 2004 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Sat, 11 Dec 2004 11:59:07 -0500 Subject: [auton-users] Opteron reservations Message-ID: <41BB275B.10409@andrew.cmu.edu> Lor1 has been reserved for Dongryeol through next Friday. This leaves lop1, lop3, lop4, and loq2 available for general usage. 12 CPUs are available for use through Condor. -Jacob From dongryel at andrew.cmu.edu Mon Dec 20 11:29:51 2004 From: dongryel at andrew.cmu.edu (Dongryeol Lee) Date: Mon, 20 Dec 2004 11:29:51 -0500 (EST) Subject: [auton-users] Opteron reservations In-Reply-To: <41BB275B.10409@andrew.cmu.edu> References: <41BB275B.10409@andrew.cmu.edu> Message-ID: Hi, Lor1 is now available for use. Dongryeol Lee Computer Science/Mathematical Sciences Senior 05' From jmjoseph at andrew.cmu.edu Tue Dec 21 16:01:08 2004 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Tue, 21 Dec 2004 16:01:08 -0500 Subject: [auton-users] Cluster firewall Message-ID: <41C88F14.6040308@andrew.cmu.edu> Hi. Due to some suspicious network traffic going out of the cluster machines, I have placed further restrictions upon traffic going out. While I've allowed the normal services (ssh, http, https, ftp, mysql, postgres), it is possible I have inadvertently blocked something you need. Please contact me if you know you need other outgoing ports. Thanks, -Jacob