From jmjoseph at andrew.cmu.edu Wed Jun 1 13:34:53 2005 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Wed, 01 Jun 2005 13:34:53 -0400 Subject: [auton-users] Cluster/condor meeting Message-ID: <429DF1BD.2020508@andrew.cmu.edu> Hi all. Cluster policy changes are in the works and I'd like to have a chance for user comments and suggestions. The discussion will be primarily focused around Condor usability. I also welcome general comments about your experiences with the cluster machines. If you use or plan to use any of the lab's computing resources, I'd appreciate if you could make it. The decisions made will be announced at the next lab meeting. Monday, June 6th 12 noon NSH 4201 -Jacob From sabhnani+ at cs.cmu.edu Thu Jun 2 10:51:00 2005 From: sabhnani+ at cs.cmu.edu (Robin Sabhnani) Date: Thu, 02 Jun 2005 10:51:00 -0400 Subject: [auton-users] update address book Message-ID: <429F1CD4.3050900@cs.cmu.edu> Hi all, I managed to confuse the SCS's mail server by picking my lastname as the CS username. Now all my mails are getting bounced because there are two users with lastname 'sabhnani'. If you want to send me an email, please send it to sabhnani+ at cs.cmu.edu instead of sabhnani at cs.cmu.edu Thanks, Robin From jmjoseph at andrew.cmu.edu Mon Jun 6 10:16:39 2005 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Mon, 06 Jun 2005 10:16:39 -0400 Subject: [auton-users] lofty restored Message-ID: <42A45AC7.5040001@andrew.cmu.edu> Hi. Between 9:30 and 10:00 this morning, lofty(our primary fileserver) suffered an unexplained outage. I have restored the machine to service and continue to investigate the cause. If you have any continuing issues, please email admin at autonlab.org. -Jacob From jmjoseph at andrew.cmu.edu Mon Jun 6 10:28:15 2005 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Mon, 06 Jun 2005 10:28:15 -0400 Subject: [auton-users] Cluster/condor meeting In-Reply-To: <429DF1BD.2020508@andrew.cmu.edu> References: <429DF1BD.2020508@andrew.cmu.edu> Message-ID: <42A45D7F.7020607@andrew.cmu.edu> Just a reminder... If you have a few spare minutes, I'd love to get your input. -Jacob Jacob Joseph wrote: > Hi all. > Cluster policy changes are in the works and I'd like to have a chance > for user comments and suggestions. The discussion will be primarily > focused around Condor usability. I also welcome general comments about > your experiences with the cluster machines. > > If you use or plan to use any of the lab's computing resources, I'd > appreciate if you could make it. The decisions made will be announced > at the next lab meeting. > > Monday, June 6th > 12 noon > NSH 4201 > > -Jacob From psarkar at cs.cmu.edu Tue Jun 7 17:18:26 2005 From: psarkar at cs.cmu.edu (Purnamrita Sarkar) Date: Tue, 7 Jun 2005 17:18:26 -0400 Subject: [auton-users] Fw: Andrew Moore Elected AAAI Fellow Message-ID: <003e01c56ba6$71c3fce0$ecb90280@sp.cs.cmu.edu> Congratulations , Andrew ! ----- Original Message ----- From: Diane Stidle To: cald-faculty at cs.cmu.edu ; cald-students at cs.cmu.edu Sent: Tuesday, June 07, 2005 3:45 PM Subject: Fwd: Andrew Moore Elected AAAI Fellow Delivered-To: diane+ at ux5.sp.cs.cmu.edu X-Mailer: exmh version 2.3.1 01/18/2001 with nmh-1.0.4 To: admin-sharon at cs.cmu.edu Subject: Andrew Moore Elected AAAI Fellow From: Reid Simmons Date: Tue, 07 Jun 2005 13:11:37 -0400 X-Sender: Reid_Simmons at lana.autonomy.ri.cmu.edu Please join me in congratulating Andrew Moore on his recent election as a fellow of the American Association of Artificial Intelligence. Andrew was selected for "significant contributions to machine learning, data mining, and statistical AI, and for major roles in transferring these technologies to industry and government." He is one of only four fellows elected this year (the others being Usama Fayyad, Ray Mooney, David Smith), and joins 14 other past and present Carnegie Mellon faculty as Fellows of AAAI. Andrew will be honored at a dinner, in conjunction with the National Conference on Artificial Intelligence that is being held in Pittsburgh in July. Congratulations Andrew! Reid -------------- next part -------------- An HTML attachment was scrubbed... URL: From komarek at cmu.edu Wed Jun 15 09:10:57 2005 From: komarek at cmu.edu (Paul Komarek) Date: Wed, 15 Jun 2005 09:10:57 -0400 Subject: [auton-users] Please avoid lop2 until further notice Message-ID: <42B028E1.1060509@cmu.edu> Hi everyone, Summary: don't login to lop2, or use lop2, until further notice. It seems that lop2 might have been added to the condor pool. Even if you are able to login to lop2, please avoid using it until it has been removed from the condor pool. We'll send another message once we've got this figured out, and we'll let you know that it is fine to login again. Hopefully I am wrong, and this is just an unnecessary precaution. -Paul From jmjoseph at andrew.cmu.edu Wed Jun 15 11:04:21 2005 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Wed, 15 Jun 2005 11:04:21 -0400 Subject: [auton-users] Please avoid lop2 until further notice In-Reply-To: <42B028E1.1060509@cmu.edu> References: <42B028E1.1060509@cmu.edu> Message-ID: <42B04375.3070806@andrew.cmu.edu> Please ignore this message. I have been watching lop2 very closely for the past two days or so. It was unused and Condor was very full, so I allowed processing on it to complete a few more jobs. I have full intentions of removing it. If this has affected anyone, I am not aware of it. (Send me email if so.) -Jacob Paul Komarek wrote: > Hi everyone, > > Summary: don't login to lop2, or use lop2, until further notice. > > It seems that lop2 might have been added to the condor pool. Even if > you are able to login to lop2, please avoid using it until it has been > removed from the condor pool. We'll send another message once we've got > this figured out, and we'll let you know that it is fine to login again. > > Hopefully I am wrong, and this is just an unnecessary precaution. > > -Paul From jmjoseph at andrew.cmu.edu Thu Jun 16 13:07:49 2005 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Thu, 16 Jun 2005 13:07:49 -0400 Subject: [auton-users] No jobs on lop1 Message-ID: <42B1B1E5.2000508@andrew.cmu.edu> Hi. As lop1 is responsible for keeping the Condor pool running, please run no jobs on it. Feel free to use it for brief, small memory testing. Lop2 is available for interactive runs. Thanks. -Jacob From jmjoseph at andrew.cmu.edu Mon Jun 20 11:35:38 2005 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Mon, 20 Jun 2005 11:35:38 -0400 Subject: [auton-users] lop1 afs Message-ID: <42B6E24A.5010005@andrew.cmu.edu> I've temporarily stopped AFS on lop1. It seems to be causing a crash which I'm still debugging. lop2 definitely has AFS still if you need it. -Jacob From jmjoseph at andrew.cmu.edu Wed Jun 22 15:28:10 2005 From: jmjoseph at andrew.cmu.edu (Jacob Joseph) Date: Wed, 22 Jun 2005 15:28:10 -0400 Subject: [auton-users] End of Condor Message-ID: <42B9BBCA.3040803@andrew.cmu.edu> Hi. As a few users have discovered, we've been having a couple of technical issues with Condor. Most importantly, certain jobs are evicted with no apparent reason when trying to write the Condor log file. Additionally, some scheduling anomalies have become apparent, leading to queue times longer than should be expected. Both of these issues seem to be related to bugs within Condor. Given that this is a closed-source product, I have contacted the developers a number of times with limited success. These issues could potentially be solved with further effort or through migration to a different batching system. For now, though, it has been decided that no further time will be spent correcting Condor or testing other systems. As such, I am shutting Condor down, effective this evening. Jobs submitted before then will be allowed to finish. -Jacob From mjbaysek at cs.cmu.edu Mon Jun 27 20:37:56 2005 From: mjbaysek at cs.cmu.edu (Michael J. Baysek) Date: Mon, 27 Jun 2005 20:37:56 -0400 Subject: [auton-users] VPN Upgrade 9 PM Message-ID: <42C09BE4.5040308@cs.cmu.edu> All, This evening beginning around 9:00 PM we will be upgrading the VPN software on our firewall. If any of you are still on the system at this time, you will very likely experience disconnection and intermittent delays to your favorite servers. We expect the service interruption to last between 30-60 minutes. We apologize for the interruption. It was decided to perform the upgrade during evening hours, to hopefully impact as few people as possible. If you have any problems accessing the systems after the outage window, please let us know as soon as possible. Another email will follow this one after the upgrade is complete. --- Mike Baysek Systems Analyst Auton Lab From mjbaysek at cs.cmu.edu Tue Jun 28 02:28:29 2005 From: mjbaysek at cs.cmu.edu (Michael J. Baysek) Date: Tue, 28 Jun 2005 02:28:29 -0400 Subject: [auton-users] Re: VPN Upgrade 9 PM In-Reply-To: <42C09BE4.5040308@cs.cmu.edu> References: <42C09BE4.5040308@cs.cmu.edu> Message-ID: <42C0EE0D.4070409@cs.cmu.edu> All, The new software has been running on the server side for some time now, and seems to be running well. If, for some reason, you do encounter problems in the morning, I am in NSH 3123, and I will be in by 9:00 am to help you work through them. However, nobody should notice any difference [from what you are used to] in the morning. Before the end of the week I will need to visit each of your machines to make a few configuration changes to take advantage of the new software. Most of this could be done remotely, but I prefer to do this in person. This will also help me complete my list of information about your systems so I can be properly equipped to support them. Plus, it will give me a chance to meet those of you whom I have not yet met one-on-one. Thank you for your patience during the upgrade. --- Mike Baysek Systems Analyst Auton Lab Michael J. Baysek wrote: > All, > > This evening beginning around 9:00 PM we will be upgrading the VPN > software on our firewall. If any of you are still on the system at > this time, you will very likely experience disconnection and > intermittent delays to your favorite servers. We expect the service > interruption to last between 30-60 minutes. > We apologize for the interruption. It was decided to perform the > upgrade during evening hours, to hopefully impact as few people as > possible. If you have any problems accessing the systems after the > outage window, please let us know as soon as possible. > > Another email will follow this one after the upgrade is complete. > > --- > Mike Baysek > Systems Analyst > Auton Lab > >