My current research interests are in software engineering and programming languages. I apply data analytics techniques (mining software repositories) to study how developers use programming languages, to help gain insight into features they use and how they use them. I believe future programming languages can be better designed by leveraging data.
I am a lead researcher and engineer for the Boa language and infrastructure. Boa is a virtual laboratory for data-intensive research on open-source software development. Boa provides better abstractions for easily and efficiently mining ultra-large-scale software repositories.
I am also working on PAClab - a Program Analysis ColLABoratory for automatically generating benchmark programs for program analysis researchers.
Recent and Select Publications
- ICSE:
Boa: A Language and Infrastructure for Analyzing Ultra-Large-Scale Software Repositories
Hoan Anh Nguyen,
Hridesh Rajan,
Tien N. Nguyen. May 23, 2013. ,
Acceptance rate: 85/461 (18.43%)
- MSR:
Escaping the Time Pit: Pitfalls and Guidelines for Using Time-Based Git Data
Distinguished Paper Award
Samuel W. Flint,
Jigyasa Chauhan,
. May 17, 2021.Acceptance rate: 48/140 (34.29%)
- MODULARITY:
Modular Reasoning in the Presence of Event Subtyping
Best Paper Award
Mehdi Bagherzadeh,
,
Rex D. Fernando,
José Sánchez,
Hridesh Rajan. March 18, 2015.Acceptance rate: 12/25 (48.00%)
- NEW ICSE: Data-Driven Evidence-Based Syntactic Sugar Design David OBrien, , Tien N. Nguyen, Hridesh Rajan. April 17, 2024.
- NEW ICSE: Boidae: Your Personal Mining Platform Brian Sigurdson, Samuel W. Flint, . April 17, 2024.
Current Students
- Salomé Perez-RoseroCo-advised with Witawas Srisa-anPh.D.
Her research interests are at the intersection of software engineering and artificial learning, where she uses advanced repository mining, heuristics to cluster intents of units of software, and ML/DL techniques to study source code patterns that characterize software bugs, vulnerabilities and code fixes.
Recent Service
- PC Member
– FSE 2025June 23-27, 2025
ACM International Conference on the Foundations of Software Engineering - PC Member
– SANER 2025March 4-7, 2025
32nd IEEE International Conference on Software Analysis, Evolution and Reengineering - Organizing Committee
– SPLASH 2020November 15-21, 2020
ACM SIGPLAN conference on Systems, Programming, Languages, and Applications: Software for Humanity