Saturday, January 25 • 12:00pm - 12:55pm
package2vec: getting to know PyPI packages with ML

Recently, Bommarito et al. released the paper “An Empirical Analysis of the Python Package Index (PyPI)” that explores many interesting statistics concerning the Python ecosystem. Can we use machine learning to go beyond pure statistics? This session will discuss how various SOTA Natural Language Processing and Graph Neural Network techniques can be applied to give new insights into packages on PyPI. Specifically, we will detail our approaches to embedding Python packages into learned vector spaces to reveal package similarity and topics within PyPI. In addition, we will discuss the potential applications and benefits of having these learned representations in the context of package recommendations for developers.


Devin de Hueck

AI Data Engineering Intern, Red Hat
Interested in all things ML

Saturday January 25, 2020 12:00pm - 12:55pm CET
A112 Faculty of Information Technology Brno University of Technology, Božetěchova, Brno-Královo Pole, Czechia