Case Western Reserve University · Computer and Data Sciences

reSAID Lab

Responsible Software and AI Design Lab

The reSAID Lab studies the principles and practices for specifying, building, evaluating, and maintaining AI-enabled software systems. We combine empirical software engineering, formal methods, program analysis, and systems-oriented experimentation to understand how LLMs, coding agents, and ML components behave in real development settings. Our goal is to develop rigorous methods and practical tools that improve reliability, fairness, safety, and maintainability across the lifecycle of responsible AI software.

Projects

We study how autonomous coding agents fail during ordinary development work and design safeguards, from constraint enforcement to failure transparency and safe-halt behaviors, for deploying them responsibly.

Testing and analysis for hidden failure modes in large language and vision-language models, from social bias under black-box access to reasoning-level backdoors.

Methods that make language-model reasoning more deliberate, structured, and inspectable, separating high-level planning from low-level action generation.

All projects →

News

Publication

ReShift backdoor paper accepted to ECCV 2026

Our paper proposing ReShift, a reasoning-level backdoor framework for Vision–Language Models, was accepted to ECCV 2026 in Malmö, Sweden. Related paper

Service

Co-Chair of the AAAI Fall Symposium on Trustworthy Agentic Systems (TAS 2026)

Sumon Biswas is serving as Co-Chair of the AAAI Fall Symposium on Trustworthy Agentic Systems (TAS 2026), November 5-7, 2026, in Arlington, Virginia. Link

Publication

Agentic code safety paper accepted to ASE 2026

Our empirical study characterizing operational safety failures of LLM-based coding agents was accepted to ASE 2026 in Munich, Germany. Related paper

Lab

Ruksaar Shaik defends M.S. project and graduates

Ruksaar Shaik successfully defended her M.S. project and graduated in May 2026. Her work focused on building domain-specific LLM models for detecting intimate partner violence (IPV) from natural-language social media posts.

Talk

Copilot teaching tutorial at the NSF AI Unlocked workshop

Sumon Biswas presented the hands-on tutorial 'Teaching Code-Generation Courses with GitHub Copilot' at the NSF AI Unlocked workshop, hosted by CU Boulder Research Computing with ACCESS and the NAIRR Pilot.

All news →

Recent Publications

What Breaks When LLMs Code? Characterizing Operational Safety Failures of Agentic Code Assistants

Alif Al Hasan, Sumon Biswas

41st IEEE/ACM International Conference on Automated Software Engineering (ASE) 2026

ReShift: Aha-Moment-Driven Reasoning-Level Backdoor Attacks on Vision–Language Models

Zhihao Dou, Qinjian Zhao, Zhiqiang Gao, Sumon Biswas

European Conference on Computer Vision (ECCV) 2026

Plan Then Action: High-Level Planning Guidance Reinforcement Learning for LLM Reasoning

Zhihao Dou, Qinjian Zhao, Zhongwei Wan, Dinggen Zhang, Weida Wang, Towsif Raiyan, Benteng Chen, Qingtao Pan, Yang Ouyang, Zhiqiang Gao, Shufei Zhang, Sumon Biswas

43rd International Conference on Machine Learning (ICML) 2026

FairSense: Long-Term Fairness Analysis of ML-Enabled Systems

Yining She, Sumon Biswas, Christian Kästner, Eunsuk Kang

47th IEEE/ACM International Conference on Software Engineering (ICSE) 2025

Are Prompt Engineering and TODO Comments Friends or Foes? An Evaluation on GitHub Copilot

David O’Brien, Sumon Biswas, Sayem Imtiaz, Rabe Abdalkareem, Emad Shihab, Hridesh Rajan

46th IEEE/ACM International Conference on Software Engineering (ICSE) 2024

Towards Safe ML-Based Systems in Presence of Feedback Loops

Sumon Biswas, Yining She, Eunsuk Kang

International Workshop on Dependability and Trustworthiness of Safety-Critical Systems with Machine Learned Components, ESEC/FSE (SE4SafeML) 2023

Fix Fairness, Don't Ruin Accuracy: Performance Aware Fairness Repair using AutoML

Giang Nguyen, Sumon Biswas, Hridesh Rajan

31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE) 2023

Towards Understanding Fairness and its Composition in Ensemble Machine Learning

Usman Gohar, Sumon Biswas, Hridesh Rajan

45th IEEE/ACM International Conference on Software Engineering (ICSE) 2023

Fairify: Fairness Verification of Neural Networks

Sumon Biswas, Hridesh Rajan

45th IEEE/ACM International Conference on Software Engineering (ICSE) 2023

23 Shades of Self-Admitted Technical Debt: An Empirical Study on Machine Learning Software

David O’Brien, Sumon Biswas, Sayem Imtiaz, Rabe Abdalkareem, Emad Shihab, Hridesh Rajan

30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (FSE) 2022

All publications →