PyData Miami 2022

Nelson Correa

Nelson is Founder and CEO of Andinum, Inc., and has 30 years of experience in natural language processing, machine learning and software development. Prior to Andinum, Nelson was data architect and data scientist at Bank of America; postdoctoral researcher, visiting scientist and senior software engineer in natural language processing at IBM Research; Professor in the Department of Electrical Engineering at Universidad de los Andes; and Vice-president of Engineering at two VC-funded startups in New York. Nelson holds a Ph.D. degree in Electrical Engineering and a Masters degree in Mathematics from Syracuse University.

The speaker's profile picture

Sessions

09-22
17:50
30min
Enterprise Semantic Search with Python Large Language Models
Nelson Correa

Enterprise Search is a key use case in big data and business computing. In this talk we introduce Enterprise Semantic Search with Large Language Models (LLMs), and present a working demonstration in the financial domain. Semantic search is search based on meaning representations, instead of literal document and query keywords. We use the recent HuggingFace transformers library, together with related Python libraries (TensorFlow, sklearn and UMAP) for NLP and deep learning. Approaches, data visualization, metrics and datasets for search system evaluation are introduced. The talk will be of interest to developers working on text search and new unstructured data applications. Slides and a demo notebook will be available at the time of PyData Miami 2022.

Main Room