ipython hangs on Auton cluster

Viraj Mehta virajm at andrew.cmu.edu
Tue Aug 18 22:26:27 EDT 2020


I’m pretty sure it’s not an upstream bug, as many environments (conda and virtualenv) which were working with ipython across several python versions before are now not working. 

I understand that ipython and ipdb aren’t typically required for Python workflows but certain efforts, like stepping through code that requires a GPU and loads a model from the Auton cluster, are difficult to debug without ipdb. Is there anything else that has changed that might have broken it? 

Thanks,
Viraj

> On Aug 18, 2020, at 6:21 PM, Predrag Punosevac <predragp at andrew.cmu.edu> wrote:
> 
> I looked a bit more carefully. It could be an upstream bug. It wouldn't be the first time
> 
> https://github.com/ipython/ipython/issues/11678 <https://github.com/ipython/ipython/issues/11678>
> 
> You don't need ipython to run Python code. You could work and debug your code on your local machine and just run production code on the server. A typical python code is just a script starting with a shebang following with a path to the binaries. I fail to see how ipython could be useful for that. It is surely useful for the interactive work. 
> 
> Predrag
> 
> On Tue, Aug 18, 2020 at 5:45 PM Viraj Mehta <virajm at andrew.cmu.edu <mailto:virajm at andrew.cmu.edu>> wrote:
> Tried this with 3.7 and 3.8 and it still hangs. Also if it’s a good clue, it doesn’t stop even if I send SIGINT or SIGQUIT. Not really sure what’s going on here.
> 
>> On Aug 18, 2020, at 4:39 PM, Viraj Mehta <virajm at andrew.cmu.edu <mailto:virajm at andrew.cmu.edu>> wrote:
>> 
>> Yeah, I’ll give it a shot. Thanks!
>> 
>>> On Aug 18, 2020, at 4:38 PM, Predrag Punosevac <predragp at andrew.cmu.edu <mailto:predragp at andrew.cmu.edu>> wrote:
>>> 
>>> I just upgraded all /opt/conda-py37 and /opt/conda-py38 packages on both GPU9 and GPU11. Could you please try again? Could you also try with py38 which is now recommended and report back. If this works I will upgrade packages across all servers. This could be potentially remotely related to the fact that Ifegenia could not build TensorFlow. Another thought is that the ipython SQLite database is corrupted. 
>>> 
>>> Best,
>>> Predag
>>> 
>>> On Tue, Aug 18, 2020 at 4:34 PM Viraj Mehta <virajm at andrew.cmu.edu <mailto:virajm at andrew.cmu.edu>> wrote:
>>> Hi Predrag,
>>> 
>>> Hope you’re doing well. I’ve been running into an issue the last couple days on the Auton cluster that is blocking my work on code that used to work and was hoping to get your thoughts. I have tried to distill this down to a small but replicable issue, as seen in the attachment, which I have seen hang on the ipython call on GPU9 and GPU11 so far. Do you know why this might be? Thanks.
>>> 
>>> Best,
>>> Viraj
>> 
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.srv.cs.cmu.edu/pipermail/autonlab-users/attachments/20200818/67b47c40/attachment.html>


More information about the Autonlab-users mailing list