alexanderpanchenko



Personal Websites

Alexander Panchenko

Hello, I am Alexander, an assistant professor for Natural Language Processing (NLP). My main research interest is computational lexical semantics, including word sense embeddings, word sense induction, extraction of lexical resources, and other related topics. I am also interested in argument mining. More generally, I am interested in neural and statistical natural language processing, information retrieval, knowledge bases, machine learning and intersections/interactions of these fields. You can find the list of my publications below on this page and also at Google Scholar.

I am with the Skoltech since 2019. My background is almost a decade of exciting research and developments in the field of NLP: I worked on a range of problems and tasks, such as semantic relatedness, word sense disambiguation, and induction, sentiment analysis, gender detection, taxonomy induction, etc . Before Skoltech, I was a Postdoctoral researcher in the group of Chris Biemann at the University of Hamburg, Germany.  Prior to the appointment in Hamburg, I had a position of Postdoc at TU Darmstadt. I received my PhD in Computational Linguistics from the Université catholique de Louvain, Belgium. During these years, I (co-)authored more than 40 peer-reviewed research publications, including papers in top-tier conference proceedings, such as ACL, EMNLP, EACL, and ECIR receiving (with co-authors) the best paper awards at the “Representation Learning for NLP” (RepL4NLP) workshop at ACL 2016 and SemEval’2019 competition on “Unsupervised Frame Induction”. I co-organised two shared tasks on semantic relatedness and word sense induction evaluation for the Russian language (RUSSE’15 and RUSSE’18). I served also as a co-editor of a data science conference on Analysis of Social Networks, Images, and Texts (AIST) with the proceedings published in Springer LNCS series.

Monographs

  • Alexander Pancheno (2013): Similarity Measures for Semantic Relation Extraction, PhD Thesis, Université catholique de Louvain
  • Alexander Panchenko (2008): Automatic Thesaurus Construction System, Graduation Thesis, Moscow State Technical University (BMSTU)

Edited volumes

Journal articles

Conference proceedings

Workshop proceedings

  • Best paper award at the first workshop on representation learning for NLP (RepL4NLP) at the ACL 2016  conference in Berlin, Germany for the paper “Makings sense of word embeddings“, where an approach for inducing sense embeddings was presented.
  • Best paper award of Fraunhofer IGD and Visual Computing Groups of TU Darmstadt (Darmstadt, Germany) in the category “Impact on the society” for the paper “new/s/leak – Information Extraction and Visualization for Investigative Data Journalists”. The paper presents an NLP system for investigative data journalism which was used by Der Spiegel journal.
  • Best of SemEval-2019: our paper on unsupervised semantic frame induction using BERT was selected as one of the best submissions (with the possibility to present our work orally) at the 13th International Workshop on Semantic Evaluation (SemEval-2019) in  Minneapolis, USA.

Supervision of PhD Theses

  • Özge Sevgili Ergüven (10.2018-…) is co-supervised with Chris Biemann (Germany). Özge is supported by DAAD (Deutscher Akademischer Austauschdienst) and is based at the University of Hamburg. Her PhD research is related to neural entity linking and representation learning on graphs.
  • Saba Anwar (10.2018-…) is co-supervised with Chris Biemann (Germany). Saba is supported by DAAD (Deutscher Akademischer Austauschdiens) and the Higher Education Commission of Pakistan. Her PhD research is related to using of neural language models for unsupervised frame induction.

Supervision of Master and Bachelor Theses

I supervised research-oriented Master theses, usually also aiming to publish a conference paper on the basis of the produced materials.

Internships & Visiting Researchers

I help to write research proposals to funding organizations which let researchers visit our faculty and do interesting short-term research project together.

  • Dmitry Puzyrev (2019): Using hyperbolic word embeddings for detection of noun compositionality. Partially funded by the University of Hamburg. Visit outcome: ACL publication.
  • Shantanu Acharya (2018): Taxonomy induction using word sense representations. Funded by DAAD. Visit outcome: ACL publication.
  • Andrey Kutuzov (2018): Learning graph embeddings via node similarities. Funded by the University of Oslo. Visit outcome: ACL and *SEM publications.
  • Artem Chernodub (2017): Recurrent Neural Networks for Argument Mining. Funded by DAAD. Visit outcome: ACL demo paper.
  • Dmitry Ustalov (2016): Graph Clustering for Word Sense Induction. Funded by DAAD. Visit outcome: ACL and EACL publications.
  • Statistical Natural Language Processing (2019). A course for Master students. Slides. The course is based on the textbook of Jurafsky & Martin and represents a set of topics on (mostly) pre-neural NLP. The course is based on the NLP course of the University of Hamburg.

Organization of Events

Programme Committee for Conferences and Workshops

  • ISCW 2019: International Conference on Computational Semantics (ACL SIGSEM special interest group on semantics)
  • CoNLL 2018, 2019: The SIGNLL Conference on Computational Natural Language Learning
  • ACL 2018, 2019: Annual Meeting of the Association for Computational Linguistics.
  • *SEM 2018, 2019: Joint Conference on Lexical and Computational Semantics
  • NAACL 2018, 2019: North American Chapter of the Association for Computational Linguistics: Human Language Technologies
  • SocInfo 2018: Social Informatics
  • CLL 2018: 3rd Workshop on “Computational linguistics and language science”
  • EMNLP 2017, 2018, 2019: The Conference on Empirical Methods on Natural Language Processing
  • ESWC 2017, 2018: The Semantic Web conference
  • ASSET 2017: Workshop on Advanced Solutions for Semantic Extraction from Texts co-located with the ESWC 2017 conference
  • TextGraphs 2016, 2017, 2018, 2019: Workshop on Text Graph co-located with the ACL/EMNLP/NAACL conferences.
  • ReprL4NLP 2017, 2018: Workshop on Representation Learning for NLP co-located with the ACL conference.
  • SMERP 2017: International Workshop on Exploitation of Social Media for Emergency Relief and Preparedness (co-located with 39th European Conference on Information Retrieval (ECIR 2017)
  • COLING 2016, 2018: International Conference on Natural Language Processing
  • AINL 2015, 2016, 2017: Conference on Artificial Intelligence and Natural Language
  • SEMANTiCS 2016, 2017: International Conference on Semantic Systems
  • Dialogue 2015, 2016, 2017, 2018: International Conference on Computational Linguistics
  • RECITAL 2015, 2016, 2017: Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, co-located with TALN conference
  • NLDB 2015, 2016, 2017: International Conference on Natural Language & Information Systems
  • WI 2014, 2015: IEEE/WIC/ACM International Conference on Web Intelligence
  • RuSSIR 2014, 2015: Young Scientists Conference at Russian Summer School in Information Retrieval
  • AIST 2014, 2015, 2016, 2017, 2018: Conference on Analysis of Images, Social Networks and Texts
  • RANLP 2013, 2015: Conference on Recent Advances in Natural Language Processing
  • LTC 2011, 2013: The Language and Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics

Journal Reviewing

  • Information Processing & Management, Elsevier (2018)
  • Language Resources & Evaluation, Springer (2018, 2019)
  • PLOS ONE (2018)
  • Natural Language Engineering, Cambridge University Press (2018)
  • Data & Knowledge Engineering (DATAK), Elsevier (2017, 2018)
  • International Journal of Artificial Intelligence and Soft Computing, Interscience (2016)
  • Internet Computing Journal, IEEE (2015)
  • International Journal of Child Abuse & Neglect, Elsevier (2014)

Reviewing of Ph.D. Dissertations

  • “Knowledge-based approaches to producing large-scale training data from scratch for Word Sense Disambiguation and Sense Distribution Learning” by Tomasso Passini, Sapienza Università di Roma
  • “Methods for compression of neural networks for natural language processing”, Artem Grachev, Higher School of Economics