Download Full CV (May, 2020)
Research interests
Applied AI, Data Science, Deep Learning, Machine Learning, Natural Language Processing, Information Retrieval, Health Informatics
Experience
Massachusetts Institute of Technology, USA (Sep 2019 – Now)
Postdoctoral Associate
Qatar Computing Research Institute, Qatar (Mar 2018 – Sep 2019)
Research Associate
Carnegie Mellon University, Qatar (Mar 2017 – May 2017)
Visiting Professor
Joseph Fourier University, France (Jan 2016 – Fev 2016)
Visiting Research Scientist
QUT, Australia (Oct 2015 – Nov 2015)
Visiting Research Scientist
Kconnect, Austria (Jan 2015 – Dec 2016)
Research Assistant
Health On The Net, Switzerland (Feb 2013 – Feb 2013)
Visiting Researcher
Khresmoi, Austria (Jan 2012- Dec 2014)
Research Assistant
Infosys, India (Jan 2011 – Apr 2011)
Internship
Jasper Design Automation, Brazil (2009-2010)
Research and development
Samba Tech, Brazil (2009-2009)
Internship
Universidade Federal de Minas Gerais, Brazil (2005-2008)
Research intern
Education
PhD in Computer Science (2011 – 2019)
Adviser: Allan Hanbury
Co-Adviser: Guido Zuccon
Title: Understandability and Expertise in Consumer Health Search – Retrieving topically relevant and understandable health information on the Web.
MSc in Computer Science (2010 – 2011)
Adviser: Gisele Pappa
Developed a method to assign weights to features for text classification based on Genetic Programming.
BSc in Computer Science (2005 – 2009)
GPA: 3.83 out of 4.00 (96%)
Best of his class.
Software
- TrecTools is an open-source Python library for assisting Information Retrieval (IR) practitioners with TREC-like campaigns. It provides an interface to repetitive and common tasks such as analyzing and evaluating runs, running traditional IR frameworks like Indri and Terrier with different baselines, analyzing results, or even fusing ranking lists to create a more robust run.
- Sleep-Wake Benchmark is a toolkit to process actigraphy data for sleep analysis. It provides the required procedures for data manipulation and a set of traditional and machine learning algorithms for sleep-wake sleep stage classification.
- ReadabilityCalculator is an open-source library that implements the most traditional readability formulas and procedures widely used to estimate the readability of a text.
Languages
Portuguese
Native
English
Advanced
Spanish
Advanced
German
Basic
Technologies
- Machine learning and data analysis: Python, Scikit-learn, Pytorch, TensorFlow, NumPy, SciPy, Pandas, matplotlib, NLTK.
- Information retrieval frameworks: ElasticSearch/Lucene/SOLR, Indri/Lemur, Terrier.
- Favorite text editors: Vim, Jupyter.
- System evaluation: A/B testing, interleaving, IR evaluation in TREC/CLEF.