Case Western Reserve University · Computer and Data Sciences

reSAID Lab

Responsible Software and AI Design Lab

The reSAID Lab studies the principles and practices for specifying, building, evaluating, and maintaining AI-enabled software systems. We combine empirical software engineering, formal methods, program analysis, and systems-oriented experimentation to understand how LLMs, coding agents, and ML components behave in real development settings. Our goal is to develop rigorous methods and practical tools that improve reliability, fairness, safety, and maintainability across the lifecycle of responsible AI software.

Projects

Coding Agents and Operational Safety

We study how autonomous coding agents fail during ordinary development work and design safeguards, from constraint enforcement to failure transparency and safe-halt behaviors, for deploying them responsibly.

ASE'26ICSE'24

Trustworthy LLMs and VLMs

Testing and analysis for hidden failure modes in large language and vision-language models, from social bias under black-box access to reasoning-level backdoors.

ECCV'26arXiv'26

LLM Reasoning and Planning

Methods that make language-model reasoning more deliberate, structured, and inspectable, separating high-level planning from low-level action generation.

ICML'26

Long-Term Fairness and ML Safety

Simulation-based analysis of long-term fairness and safety in ML-enabled systems whose decisions reshape their own future inputs through feedback loops.

ICSE'25

Fairify: Fairness Verification of Neural Networks

SMT-based verification of individual fairness in neural networks, using input partitioning and sound neural pruning to produce certificates or counterexamples for real-world models.

ICSE'23

ML Software Maintenance and Technical Debt

We study how technical debt appears and evolves in machine learning software, mining self-admitted technical debt at scale to guide the maintenance of ML systems.

FSE'22

All projects →

News

Jun 2026

Publication

ReShift backdoor paper accepted to ECCV 2026

Our paper proposing ReShift, a reasoning-level backdoor framework for Vision–Language Models, was accepted to ECCV 2026 in Malmö, Sweden. Related paper

Jun 2026

Service

Co-Chair of the AAAI Fall Symposium on Trustworthy Agentic Systems (TAS 2026)

Sumon Biswas is serving as Co-Chair of the AAAI Fall Symposium on Trustworthy Agentic Systems (TAS 2026), November 5-7, 2026, in Arlington, Virginia. Link

May 2026

Publication

Agentic code safety paper accepted to ASE 2026

Our empirical study characterizing operational safety failures of LLM-based coding agents was accepted to ASE 2026 in Munich, Germany. Related paper

May 2026

Lab

Ruksaar Shaik defends M.S. project and graduates

Ruksaar Shaik successfully defended her M.S. project and graduated in May 2026. Her work focused on building domain-specific LLM models for detecting intimate partner violence (IPV) from natural-language social media posts.

May 2026

Talk

Copilot teaching tutorial at the NSF AI Unlocked workshop

Sumon Biswas presented the hands-on tutorial 'Teaching Code-Generation Courses with GitHub Copilot' at the NSF AI Unlocked workshop, hosted by CU Boulder Research Computing with ACCESS and the NAIRR Pilot.

All news →

People

Sumon Biswas

Principal Investigator

Assistant Professor, Department of Computer and Data Sciences, Case Western Reserve University

Ph.D. Students

Alif Al Hasan

Ph.D. Student

Jitong Zou

Ph.D. Student

Zhihao Dou

Ph.D. Student

M.S. Students

Panimalar Gobichettipalayam Annadurai

M.S. Student

Towsif Raiyan

M.S. Student

Undergraduate Researchers

Avyukth Sai Rangarajan

Undergraduate Researcher

Devak Pardasani

Undergraduate Researcher

Phat Dang

Undergraduate Researcher

Ram Aryan Mallampati

Undergraduate Researcher

Sam Lin

Undergraduate Researcher

Ali Nawaf

Undergraduate Researcher

Anika Kaur

Undergraduate Researcher

Khue Luong

Undergraduate Researcher

Maximillian Schulten

Undergraduate Researcher

High School

Sharon Sharma

K-12 Intern

Recent Publications

What Breaks When LLMs Code? Characterizing Operational Safety Failures of Agentic Code Assistants

Alif Al Hasan, Sumon Biswas

41st IEEE/ACM International Conference on Automated Software Engineering (ASE) 2026

arXiv

ReShift: Aha-Moment-Driven Reasoning-Level Backdoor Attacks on Vision–Language Models

Zhihao Dou, Qinjian Zhao, Zhiqiang Gao, Sumon Biswas

European Conference on Computer Vision (ECCV) 2026

arXiv

Plan Then Action: High-Level Planning Guidance Reinforcement Learning for LLM Reasoning

Zhihao Dou, Qinjian Zhao, Zhongwei Wan, Dinggen Zhang, Weida Wang, Towsif Raiyan, Benteng Chen, Qingtao Pan, Yang Ouyang, Zhiqiang Gao, Shufei Zhang, Sumon Biswas

43rd International Conference on Machine Learning (ICML) 2026

arXiv

FairSense: Long-Term Fairness Analysis of ML-Enabled Systems

Yining She, Sumon Biswas, Christian Kästner, Eunsuk Kang

47th IEEE/ACM International Conference on Software Engineering (ICSE) 2025

DOI

Are Prompt Engineering and TODO Comments Friends or Foes? An Evaluation on GitHub Copilot

David O’Brien, Sumon Biswas, Sayem Imtiaz, Rabe Abdalkareem, Emad Shihab, Hridesh Rajan

46th IEEE/ACM International Conference on Software Engineering (ICSE) 2024

DOI Data

Towards Safe ML-Based Systems in Presence of Feedback Loops

Sumon Biswas, Yining She, Eunsuk Kang

International Workshop on Dependability and Trustworthiness of Safety-Critical Systems with Machine Learned Components, ESEC/FSE (SE4SafeML) 2023

DOI

Fix Fairness, Don't Ruin Accuracy: Performance Aware Fairness Repair using AutoML

Giang Nguyen, Sumon Biswas, Hridesh Rajan

31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE) 2023

DOI

Towards Understanding Fairness and its Composition in Ensemble Machine Learning

Usman Gohar, Sumon Biswas, Hridesh Rajan

45th IEEE/ACM International Conference on Software Engineering (ICSE) 2023

DOI

Fairify: Fairness Verification of Neural Networks

Sumon Biswas, Hridesh Rajan

45th IEEE/ACM International Conference on Software Engineering (ICSE) 2023

DOI

23 Shades of Self-Admitted Technical Debt: An Empirical Study on Machine Learning Software

David O’Brien, Sumon Biswas, Sayem Imtiaz, Rabe Abdalkareem, Emad Shihab, Hridesh Rajan

30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE) 2022

DOI

All publications →