We study how autonomous coding agents fail during ordinary development work and design safeguards, from constraint enforcement to failure transparency and safe-halt behaviors, for deploying them responsibly.
Case Western Reserve University · Computer and Data Sciences
reSAID Lab
Responsible Software and AI Design Lab
The reSAID Lab studies the principles and practices for specifying, building, evaluating, and maintaining AI-enabled software systems. We combine empirical software engineering, formal methods, program analysis, and systems-oriented experimentation to understand how LLMs, coding agents, and ML components behave in real development settings. Our goal is to develop rigorous methods and practical tools that improve reliability, fairness, safety, and maintainability across the lifecycle of responsible AI software.
Projects
Testing and analysis for hidden failure modes in large language and vision-language models, from social bias under black-box access to reasoning-level backdoors.
Methods that make language-model reasoning more deliberate, structured, and inspectable, separating high-level planning from low-level action generation.
Simulation-based analysis of long-term fairness and safety in ML-enabled systems whose decisions reshape their own future inputs through feedback loops.
SMT-based verification of individual fairness in neural networks, using input partitioning and sound neural pruning to produce certificates or counterexamples for real-world models.
We study how technical debt appears and evolves in machine learning software, mining self-admitted technical debt at scale to guide the maintenance of ML systems.
News
ReShift backdoor paper accepted to ECCV 2026
Our paper proposing ReShift, a reasoning-level backdoor framework for Vision–Language Models, was accepted to ECCV 2026 in Malmö, Sweden. Related paper
Co-Chair of the AAAI Fall Symposium on Trustworthy Agentic Systems (TAS 2026)
Sumon Biswas is serving as Co-Chair of the AAAI Fall Symposium on Trustworthy Agentic Systems (TAS 2026), November 5-7, 2026, in Arlington, Virginia. Link
Agentic code safety paper accepted to ASE 2026
Our empirical study characterizing operational safety failures of LLM-based coding agents was accepted to ASE 2026 in Munich, Germany. Related paper
Ruksaar Shaik defends M.S. project and graduates
Ruksaar Shaik successfully defended her M.S. project and graduated in May 2026. Her work focused on building domain-specific LLM models for detecting intimate partner violence (IPV) from natural-language social media posts.
Copilot teaching tutorial at the NSF AI Unlocked workshop
Sumon Biswas presented the hands-on tutorial 'Teaching Code-Generation Courses with GitHub Copilot' at the NSF AI Unlocked workshop, hosted by CU Boulder Research Computing with ACCESS and the NAIRR Pilot.
People

Sumon Biswas
Principal Investigator
Assistant Professor, Department of Computer and Data Sciences, Case Western Reserve University
Ph.D. Students
M.S. Students
Undergraduate Researchers
High School
Recent Publications
What Breaks When LLMs Code? Characterizing Operational Safety Failures of Agentic Code Assistants
41st IEEE/ACM International Conference on Automated Software Engineering (ASE) 2026
ReShift: Aha-Moment-Driven Reasoning-Level Backdoor Attacks on Vision–Language Models
European Conference on Computer Vision (ECCV) 2026
Plan Then Action: High-Level Planning Guidance Reinforcement Learning for LLM Reasoning
43rd International Conference on Machine Learning (ICML) 2026
FairSense: Long-Term Fairness Analysis of ML-Enabled Systems
47th IEEE/ACM International Conference on Software Engineering (ICSE) 2025
Are Prompt Engineering and TODO Comments Friends or Foes? An Evaluation on GitHub Copilot
46th IEEE/ACM International Conference on Software Engineering (ICSE) 2024
Towards Safe ML-Based Systems in Presence of Feedback Loops
International Workshop on Dependability and Trustworthiness of Safety-Critical Systems with Machine Learned Components, ESEC/FSE (SE4SafeML) 2023
Fix Fairness, Don't Ruin Accuracy: Performance Aware Fairness Repair using AutoML
31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE) 2023
Towards Understanding Fairness and its Composition in Ensemble Machine Learning
45th IEEE/ACM International Conference on Software Engineering (ICSE) 2023
Fairify: Fairness Verification of Neural Networks
45th IEEE/ACM International Conference on Software Engineering (ICSE) 2023
23 Shades of Self-Admitted Technical Debt: An Empirical Study on Machine Learning Software
30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE) 2022






