Huy Vu

Huy Vu

PhD Candidate in Computer Science

Stony Brook University


I am currently working as a PhD candidate in Computer Science at Stony Brook University. I am grateful and proud to work with my advisor - professor H. Andrew Schwartz in Human Language Analysis Beings lab (HLAB). My main research interests are Machine Learning, especially Natural Language Processing and applying it into Psychology and Social Science research, which is one sweet-spot intersection between understanding machine and human. Having worked interdisciplinary with other natural and social scientists, I always believe that the more different scientific fields intersect, the more novel and exciting the findings will be.

Also, I love creating art as hobbies through mediums like sketching, AR/VR (augmented and virtual reality) experiences and especially dancing. Check out my creative works here!: Huy Vu’s creative playground


  • Natural Language Processing
  • Computational Social Science
  • Machine Learning


  • PhD candidate in Computer Science, 2018 - 2023

    Stony Brook University

  • BSc in Mathematics and Computer Science, 2012 - 2017

    University of Science Ho Chi Minh City



Meta (Facebook)

Machine Learning Engineer InternDeep Learning Engineer Inter

Jun 2022 – Aug 2022 California, U.S

Responsibilities include:

  • Creating synthetic training data using Text-to-Speech pipeline to improve Automatic Speech Recognition task. The main goal is to reduce the performance gaps across age-groups, making the model perform more evenly across different demographics.
  • Reduced the performance gaps in ASR task across age-groups up to 13 percents by: Doing literature review to looking for the right dataset to create synthetic data from. Analyzing speed and pitch features distribution on real data to make sure the generated augmented data simulate closely the real data.Reduced total training time of significantly for the Distill-FiD pipeline using a variety of optimization techniques including half-precision training, parallelizing data processing, fused Adam optimizer and others.


Deep Learning Engineer Intern

Jun 2021 – Aug 2021 California, U.S.

Responsibilities include:

  • Surveying literature and implementing strong models such as Distill-FiD for Open Domain Question Answering task. Working on tuning and accelerating models (running on NVIDIA’s DGX A100 and Selene supercomputer) to minimize computational costs.
  • Reduced total training time of significantly for the Distill-FiD pipeline using a variety of optimization techniques including half-precision training, parallelizing data processing, fused Adam optimizer and others.

Amazon Alexa AI

Applied Scientist Intern

Jun 2020 – Aug 2020 Washington, U.S.

Responsibilities include:

  • Working on distilling strong, powerful, but heavy transformers-based language model into more compact, fast-runningmodel while still maintaining reasonable performances.
  • Proposed a novel approach to take advantage of large models’ performance (RoBERTa, XLM) to be used in a real time ondevices with small computing capability.

Brookhaven National Lab

Research Assistant

Sep 2019 – May 2020 New York, U.S.

Responsibilities include:

  • Working with material scientists, implementing text mining algorithms for BNL’s material science literature database.
  • Built a search engine using BERT-based contextualized embeddings specifically for material science literature research, received highly rated feedback from material scientists.

University of Pennsylvania

Research Assistant Intern

Jun 2019 – Aug 2019 Pennsylvania, U.S.

Responsibilities include:

  • Working with psychologists, exploring the novel idea of applying NLP tools into psychology research, using text embeddings to analyze personality questionnaires.
  • Proved validity of the proposed hypothesizes, with correlation up to 0.421 (significant in psychology field). Paper accepted at Findings EMNLP of 2020.

Omn1Solution (Salesforce’s official partner in Vietnam)

Business Analyst

Sep 2017 – Mar 2018 Ho Chi Minh City, Vietnam

Responsibilities include:

  • Assisting in analyzing customers’ needs and designing system solution
  • Implementing and deploying Salesforce Sales Cloud system

Ho Chi Minh City University of Science

Research Assistant

Aug 2016 – Dec 2017 Ho Chi Minh City, Vietnam

Responsibilities include:

  • Proposing novel method segmenting brain images using combined Gaussian Mixture Model and Deep Learning approach.
  • A jump of 8.6(/100) points to beat state-of-the-art methods and therefore accepted to WACV 2017.


Merit Fellowship $5000 2018 – 2019

Completion of Business Administration Course

Exchange Program Scholarship 2014 – 2015

National Program for the Development of Mathematics 2013 and 2016

Excellence in Academic Activities 2013

KumHo Asiana Scholarship 2012

For 1st-ranked students in university’s entrance