My current research interests are in software engineering and programming languages. I apply data analytics techniques (mining software repositories) to study how developers use programming languages, to help gain insight into features they use and how they use them. I believe future programming languages can be better designed by leveraging data.
I am a lead researcher and engineer for the Boa language and infrastructure. Boa is a virtual laboratory for data-intensive research on open-source software development. Boa provides better abstractions for easily and efficiently mining ultra-large-scale software repositories.
I am also working on PAClab - a Program Analysis ColLABoratory for automatically generating benchmark programs for program analysis researchers.
Recent and Select Publications
- ICSE:
Boa: A Language and Infrastructure for Analyzing Ultra-Large-Scale Software Repositories
Hoan Anh Nguyen,
Hridesh Rajan,
Tien N. Nguyen. May 23, 2013. ,
Acceptance rate: 85/461 (18.43%)
- MSR:
Escaping the Time Pit: Pitfalls and Guidelines for Using Time-Based Git Data
Distinguished Paper Award
Samuel W. Flint,
Jigyasa Chauhan,
. May 17, 2021.Acceptance rate: 48/140 (34.29%)
- MODULARITY:
Modular Reasoning in the Presence of Event Subtyping
Best Paper Award
Mehdi Bagherzadeh,
,
Rex D. Fernando,
José Sánchez,
Hridesh Rajan. March 18, 2015.Acceptance rate: 12/25 (48.00%)
- NEW
MSR:
Method Chaining Redux: An Empirical Study of Method Chaining in Java, Kotlin, and Python
Ali M. Keshk,
. May 15, 2023.Acceptance rate: 43/115 (37.39%)
- An Empirical Study on the Classification of Python Language Features Using Eye-Tracking Jigyasa Chauhan. December 1, 2022. A master's thesis at University of Nebraska-Lincoln.
- ESEC/FSE: Performing Large-Scale Mining Studies: From Start to Finish (Tutorial) Samuel W. Flint. November 18, 2022. ,
- ESEC/FSE:
An Exploratory Study on the Predominant Programming Paradigms in Python Code
Jigyasa Chauhan. November 16, 2022. ,
Acceptance rate: 99/449 (22.05%)
- EMSE: Pitfalls and Guidelines for Using Time-Based Git Data Samuel W. Flint, Jigyasa Chauhan, . October 6, 2022.
Current Students
- Salomé Perez-RoseroCo-advised with Witawas Srisa-anPh.D.
Her research interests are at the intersection of software engineering and artificial learning, where she uses advanced repository mining, heuristics to cluster intents of units of software, and ML/DL techniques to study source code patterns that characterize software bugs, vulnerabilities and code fixes.
Recent Service
- PC Member, Software Engineering in Society (SEIS)
– ICSE 2024April 12-21, 2024
46th International Conference on Software Engineering - PC Member
– MSR 2024April 15-16, 2024
21st International Conference on Mining Software Repositories - PC Member
– ICPC 2024April 15-16, 2024
32st International Conference on Program Comprehension - PC Member
– ICSME 2024October 6-11, 2024
40th IEEE International Conference on Software Maintenance and Evolution - PC Member
– SANER 2024March 12-15, 2024
31st IEEE International Conference on Software Analysis, Evolution and Reengineering - PC Member
– ASE 2023September 11-15, 2023
38th International Conference on Automated Software Engineering - PC Member
– MSR 2023May 15-16, 2023
20th International Conference on Mining Software Repositories - PC Member
– ICPC 2023May 15-16, 2023
31st International Conference on Program Comprehension - PC Member
– ICSME 2023October 1-6, 2023
39th IEEE International Conference on Software Maintenance and Evolution - PC Member
– SANER 2023March 21-24, 2023
30th IEEE International Conference on Software Analysis, Evolution and Reengineering