<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<p> </p>
<p style="margin-bottom: 0in; line-height: 100%" align="center">
<font style="font-size: 11pt" size="2"><b>Post-doc position :
Information retrieval for Medical Scientific publications </b></font>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2"><b>Advisors:</b></font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">-
M. Constant (U. of Lorraine). Website :
<a class="moz-txt-link-freetext" href="https://perso.atilf.fr/mconstant/">https://perso.atilf.fr/mconstant/</a></font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">-
M. Clausel (U. Lorraine). Website :
<a class="moz-txt-link-freetext" href="https://sites.google.com/site/marianneclausel/">https://sites.google.com/site/marianneclausel/</a></font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">-
R.S. Stoica (U. Lorraine). Website :
<a class="moz-txt-link-freetext" href="https://sites.google.com/site/radustefanstoica/">https://sites.google.com/site/radustefanstoica/</a></font></p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2"><b>Other
partners of the project </b>: C. Francois (INIST), P. Oudet
(Cancéopôle Est), F. Schaffner (Cancéropôle Est), N. Thouvenin
(INIST).</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2"><b>Location:
</b>University of Lorraine, Nancy (France)</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2"><b>Keywords:
</b>Natural language processing, word embeddings, biomedical
text
mining, graph matching</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2"><b>Context:</b></font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">The
Cancéropôle Est is one of the 7 Cancéropôles created by the
first national cancer action in 2003. Its missions are
organizing,
coordinating, and strengthening research against cancer in
partnership with academic and clinical institutions by
associating
researchers, healthcare professionals, industrials and patients.</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">The
aim of the project is to establish a cartography of the
scientific
research in Oncology in the two French Regions Grand Est and
Bourgogne Franche Comté using the full text of scientific
publications of each research team in the two regions. </font>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2"><b>Description
of the position:</b></font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">This
position is funded by AMIES, University of Lorraine and
Canceropôle
Est. With this position, we would like to go use text mining
technics
to extract characteristics related to the scientific content of
the
publications of each research team in Grand Est and Bourgogne
Franche
Comt\'e. </font>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">The
recruited person will work on the following points: </font>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">-
Preprocessing of the data. The data will be provided by the
Canc\'eropole Est and will consist of several full texts in xml
or
pdf format.</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">-
Learning of oncology embedding (see for e.g. [1]). INIST will
provide
training data to learn the embedding and ontology</font></p>
<p style="margin-bottom: 0in; line-height: 100%"> <font
style="font-size: 11pt" size="2">-
Extraction of characteristics related to the scientific content
of
publications for each research team. </font>
</p>
<p style="margin-bottom: 0in; line-height: 100%"> <font
style="font-size: 11pt" size="2">-
Combine these characteristics and collaboration graph of each
team
(see for e.g. [2]) to provide general characteristics for each
team</font></p>
<p style="margin-bottom: 0in; line-height: 100%"> <font
style="font-size: 11pt" size="2">-
Integration in a vizualisation tool</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">The
recruited person will benefit from the expertise of Canceropôle
Est,
INIST and University of Lorraine in text mining and statistical
learning. </font>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">We
would ideally like to recruit a 11 month post-doc with the
following
preferred skills:</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">-
Knowledgeable in natural language processing, text mining and
word
embeddings</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">-
Knowledgeable in machine learning </font>
</p>
<p style="margin-bottom: 0in; line-height: 100%"> <font
style="font-size: 11pt" size="2">-
Good programming skills in Python (classical NLP librairies,
scikit-learn, Pytorch and/or Tensor Flow)</font></p>
<p style="margin-bottom: 0in; line-height: 100%"> <font
style="font-size: 11pt" size="2">-
Very good English (understanding and writing)</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">The
candidates should send a CV, 2 names of referees and a cover
letter
to the researchers mentioned above
(<a class="moz-txt-link-abbreviated" href="mailto:Mathieu.Constant@univ-lorraine.fr">Mathieu.Constant@univ-lorraine.fr</a>,
<a class="moz-txt-link-abbreviated" href="mailto:marianne.clausel@univ-lorraine.fr">marianne.clausel@univ-lorraine.fr</a>,
<a class="moz-txt-link-abbreviated" href="mailto:radu-stefan.stoica@univ-lorraine.fr">radu-stefan.stoica@univ-lorraine.fr</a>). The selected candidates
will be
interviewed in February for an expected start in</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">March/April
2019.</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2"><b>Bibliography</b></font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">[1]
J. Lee et al. BioBERT: a Pre Trained Biomedical Language
Representation Model for Biomedical Text Mining. Ed. Jonathan
Wren.
Bioinformatics (2019).</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><font
style="font-size: 11pt" size="2">[2]
Q. Laporte-Chabasse et al. Morpho-statistical description of
networks
through graph modelling and Bayesian inference. Preprint 2019</font></p>
<p style="margin-bottom: 0in; line-height: 100%"><br>
</p>
<p>
<style type="text/css">p { margin-bottom: 0.1in; line-height: 115%; background: transparent none repeat scroll 0% 0%; }</style></p>
</body>
</html>