INSTITUTIONAL SEMINAR
Seminar: Building Software Project Collections
Dr. Juan Andrés Carruthers, a leading expert in the design of empirical studies in software engineering, shared his vast experience and knowledge of the challenges inherent in creating robust and representative datasets. He emphasized the importance of these collections for the validity and replicability of software research, also presenting the SUM4SOFT process model, a set of best practices developed to optimize this fundamental process.


During the seminar, the application of the Design Science Research methodology in technological research was explored, highlighting its relevance for generating practical knowledge and innovative solutions. The presentation offered valuable insights for those interested in rigorous research and continuous improvement in software development, answering the question of how we can optimize our research practices in this area.
This seminar was held in a hybrid format, allowing for broad participation. The in-person audience met in the Graduate Classroom of the Faculty of Exact and Natural Sciences and Surveying (FACENA – UNNE).
Juan Andrés Carruthers graduated from the Faculty of Exact and Natural Sciences and Surveying, National University of the Northeast. He completed his PhD in Computer Science at UNNE in 2025, becoming the first graduate student to earn a PhD in Computer Science. In this new seminar, he presents the construction of software project collections for Empirical Software Engineering, developed on May 7, 2025.
Dr. Juan Andrés Carruthers holds a PhD in Computer Science and a Bachelor's degree in Information Systems (UNNE). His experience as a CONICET doctoral fellow and his participation in the Software Quality Research Group at FACENA-UNNE have established him as an expert in the design of empirical studies in software engineering. He currently teaches in the Bachelor's program in Information Systems at UNNE.
For those interested in further exploring the subject, we suggest the following publications by Dr. Carruthers:
- A longitudinal study on the temporal validity of software samples
- How are software datasets constructed in empirical software engineering studies? A systematic mapping study
- Open-Source Software Projects: Curating Model for Empirical Software Engineering Studies
This seminar represented an excellent opportunity for the IMIT community and all participants, allowing them to update their knowledge and connect with the latest trends in software engineering research. IMIT thanks Dr. Carruthers for his valuable contribution and all attendees for their active participation.