Online CV

Download Full CV (May, 2020)

Research interests

Applied AI, Data Science, Deep Learning, Machine Learning, Natural Language Processing, Information Retrieval, Health Informatics


Experience


Massachusetts Institute of Technology, USA (Sep 2019 – Now)
Postdoctoral Associate


Qatar Computing Research Institute, Qatar (Mar 2018 – Sep 2019)
Research Associate


Carnegie Mellon University, Qatar (Mar 2017 – May 2017)
Visiting Professor


Joseph Fourier University, France (Jan 2016 – Fev 2016)
Visiting Research Scientist

QUT
QUT, Australia (Oct 2015 – Nov 2015)
Visiting Research Scientist

KConnect_logo_450px
Kconnect, Austria (Jan 2015 – Dec 2016)
Research Assistant

HealthOnTheNet
Health On The Net, Switzerland (Feb 2013 – Feb 2013)
Visiting Researcher

Logo_KHRESMOI_2010-300
Khresmoi, Austria (Jan 2012- Dec 2014)
Research Assistant

Infosys-logo
Infosys, India (Jan 2011 – Apr 2011)
Internship

jasper_logo.pdf
Jasper Design Automation, Brazil (2009-2010)
Research and development

Samba-Tech-Logo-slogan
Samba Tech, Brazil (2009-2009)
Internship

ufmg_logo
Universidade Federal de Minas Gerais, Brazil (2005-2008)
Research intern


Education

TULogo
PhD in Computer Science (2011 – 2019)
Adviser: Allan Hanbury
Co-Adviser: Guido Zuccon
Title: Understandability and Expertise in Consumer Health Search – Retrieving topically relevant and understandable health information on the Web.

ufmg_logo
MSc in Computer Science (2010 – 2011)
Adviser: Gisele Pappa
Developed a method to assign weights to features for text classification based on Genetic Programming.

ufmg_logo
BSc in Computer Science (2005 – 2009)
GPA: 3.83 out of 4.00 (96%)
Best of his class.


Software

  • TrecTools is an open-source Python library for assisting Information Retrieval (IR) practitioners with TREC-like campaigns. It provides an interface to repetitive and common tasks such as analyzing and evaluating runs, running traditional IR frameworks like Indri and Terrier with different baselines, analyzing results, or even fusing ranking lists to create a more robust run.
  • Sleep-Wake Benchmark is a toolkit to process actigraphy data for sleep analysis. It provides the required procedures for data manipulation and a set of traditional and machine learning algorithms for sleep-wake sleep stage classification.
  • ReadabilityCalculator is an open-source library that implements the most traditional readability formulas and procedures widely used to estimate the readability of a text.

Languages

Portuguese

Native

English

Advanced

Spanish

Advanced

German

Basic


Technologies

  • Machine learning and data analysis: Python, Scikit-learn, Pytorch, TensorFlow, NumPy, SciPy, Pandas, matplotlib, NLTK.
  • Information retrieval frameworks: ElasticSearch/Lucene/SOLR, Indri/Lemur, Terrier.
  • Favorite text editors: Vim, Jupyter.
  • System evaluation: A/B testing, interleaving, IR evaluation in TREC/CLEF.