<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>reSAID Lab</title><link>https://resaid-lab.github.io/</link><description>Recent content on reSAID Lab</description><generator>Hugo</generator><language>en-US</language><lastBuildDate>Mon, 16 Nov 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://resaid-lab.github.io/index.xml" rel="self" type="application/rss+xml"/><item><title>Coding Agents and Operational Safety</title><link>https://resaid-lab.github.io/projects/agentic-code-safety/</link><pubDate>Mon, 16 Nov 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/agentic-code-safety/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Autonomous coding agents built on large language models are wired directly into development workflows: they edit files, run commands, configure environments, and fix bugs with growing autonomy. Most safety evaluations of these tools focus on explicitly malicious prompts, but we argue this misses the larger and more common danger: agents that fail during ordinary, goal-directed work through destructive operations, constraint violations, authorization bypasses, and silent errors that surface only after damage is done.&lt;/p&gt;</description></item><item><title>What Breaks When LLMs Code? Characterizing Operational Safety Failures of Agentic Code Assistants</title><link>https://resaid-lab.github.io/publications/agentic-code-safety-preprint-2026/</link><pubDate>Mon, 16 Nov 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/agentic-code-safety-preprint-2026/</guid><description/></item><item><title>ReShift: Aha-Moment-Driven Reasoning-Level Backdoor Attacks on Vision–Language Models</title><link>https://resaid-lab.github.io/publications/reshift-eccv26/</link><pubDate>Tue, 08 Sep 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/reshift-eccv26/</guid><description/></item><item><title>Trustworthy LLMs and VLMs</title><link>https://resaid-lab.github.io/projects/llm-bias-testing/</link><pubDate>Tue, 08 Sep 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/llm-bias-testing/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Large language and vision-language models are deployed in settings where biased,
inconsistent, or manipulated behavior can affect users, yet their internals are
often unavailable or hard to inspect. We develop methods that expose and
characterize such hidden failures, treating trustworthiness as a property that
must be tested for rather than assumed — and connecting each testing method to a
concrete path for mitigation or defense.&lt;/p&gt;
&lt;p&gt;A recurring theme in our work is that trustworthiness must account for a model&amp;rsquo;s
reasoning process, not only its final answer. Attacks and guardrails that operate
on outputs alone tend to leave reasoning traces that are inconsistent or easy to
flag, but as models increasingly expose their chain-of-thought, the reasoning
itself becomes both a new attack surface and a new opportunity for defense. We
study how bias and backdoor threats propagate through model behavior, how to
characterize them with principled signals, and how to build safeguards that hold
up against adaptive adversaries.&lt;/p&gt;</description></item><item><title>Alif Al Hasan</title><link>https://resaid-lab.github.io/people/alif-al-hasan/</link><pubDate>Sat, 25 Jul 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/alif-al-hasan/</guid><description>&lt;h2 id="about"&gt;About&lt;/h2&gt;
&lt;p&gt;Alif Al Hasan is a Ph.D. student in the Department of Computer and Data Sciences at Case Western Reserve University, working under the supervision of Prof. Sumon Biswas at the reSAID Lab. He earned his Bachelor&amp;rsquo;s and Master&amp;rsquo;s degrees in Computer Science and Engineering from Jahangirnagar University. His research operates at the intersection of Software Engineering and AI, focusing on the operational safety of autonomous LLM agents.&lt;/p&gt;
&lt;h2 id="contact"&gt;Contact&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Email:&lt;/strong&gt; &lt;a href="mailto:alifal.hasan@case.edu"&gt;alifal.hasan@case.edu&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Website:&lt;/strong&gt; &lt;a href="https://alifalhasan.github.io/"&gt;alifalhasan.github.io&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;GitHub:&lt;/strong&gt; &lt;a href="https://github.com/alifalhasan"&gt;alifalhasan&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;hr&gt;</description></item><item><title>LLM Reasoning and Planning</title><link>https://resaid-lab.github.io/projects/plan-then-action/</link><pubDate>Mon, 13 Jul 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/plan-then-action/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Large language models can appear to reason, yet generation is autoregressive: each token is chosen from the immediate context, one step at a time. This local view is powerful, but it explains familiar failure modes, such as reasoning that drifts, contradicts itself, takes redundant detours, or commits early to a path that later proves wrong. We study how to make model reasoning globally coherent, efficient, and trustworthy by helping a model decide where it is going before it takes the next step.&lt;/p&gt;</description></item><item><title>Plan Then Action: High-Level Planning Guidance Reinforcement Learning for LLM Reasoning</title><link>https://resaid-lab.github.io/publications/plan-then-action-icml26/</link><pubDate>Mon, 13 Jul 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/plan-then-action-icml26/</guid><description/></item><item><title>ReShift backdoor paper accepted to ECCV 2026</title><link>https://resaid-lab.github.io/news/eccv-2026-reshift/</link><pubDate>Thu, 18 Jun 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/eccv-2026-reshift/</guid><description>Our paper proposing ReShift, a reasoning-level backdoor framework for Vision–Language Models, was accepted to ECCV 2026 in Malmö, Sweden.</description></item><item><title>Co-Chair of the AAAI Fall Symposium on Trustworthy Agentic Systems (TAS 2026)</title><link>https://resaid-lab.github.io/news/tas-2026-cochair/</link><pubDate>Mon, 15 Jun 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/tas-2026-cochair/</guid><description>Sumon Biswas is serving as Co-Chair of the AAAI Fall Symposium on Trustworthy Agentic Systems (TAS 2026), November 5-7, 2026, in Arlington, Virginia.</description></item><item><title>What Actually Breaks When LLMs Write Code?</title><link>https://resaid-lab.github.io/posts/what-breaks-when-llms-code/</link><pubDate>Thu, 11 Jun 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/posts/what-breaks-when-llms-code/</guid><description>&lt;p&gt;Most conversations about the safety of coding agents revolve around adversarial
scenarios: prompt injection, jailbreaks, malicious instructions hidden in a
README. Those threats are real. But after watching these tools work — and
occasionally watching them wreck a working environment while &amp;ldquo;fixing&amp;rdquo; a unit
test — we kept returning to a more uncomfortable question: what goes wrong when
nobody is attacking, and the agent is simply trying to help?&lt;/p&gt;
&lt;p&gt;Our new preprint, &lt;a href="https://resaid-lab.github.io/publications/agentic-code-safety-preprint-2026/"&gt;What Breaks When LLMs Code?&lt;/a&gt;,
led by our Ph.D. student &lt;a href="https://resaid-lab.github.io/people/alif-al-hasan/"&gt;Alif Al Hasan&lt;/a&gt;, is an attempt
to answer that question with evidence rather than anecdotes. We call this
&lt;em&gt;operational safety&lt;/em&gt;: the safety of an agent during benign, goal-directed,
everyday use.&lt;/p&gt;</description></item><item><title>Teaching LLMs to Plan Before They Act</title><link>https://resaid-lab.github.io/posts/plan-then-action-icml/</link><pubDate>Wed, 10 Jun 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/posts/plan-then-action-icml/</guid><description>&lt;p&gt;If you have ever watched a language model reason its way through a hard math
problem, you have probably seen it wander. The chain of thought starts off
promising, circles back on itself, re-derives something it already knew, and
occasionally talks itself out of a correct intermediate result. The final answer
may still be right, but the path there is long, redundant, and hard to trust.&lt;/p&gt;
&lt;p&gt;Our ICML 2026 paper, &lt;a href="https://resaid-lab.github.io/publications/plan-then-action-icml26/"&gt;Plan Then Action&lt;/a&gt;,
starts from a simple diagnosis of why this happens: autoregressive generation is
local. At every step the model decides only what token comes next, so the
reasoning process is essentially a sequence of small, greedy decisions. There is
no global plan — nothing that commits the model to a strategy before it starts
executing one. Tree search and reinforcement learning can partially compensate,
but they are expensive and still operate over the same token-level process.&lt;/p&gt;</description></item><item><title>Agentic code safety paper accepted to ASE 2026</title><link>https://resaid-lab.github.io/news/ase-2026-agentic-safety/</link><pubDate>Fri, 29 May 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/ase-2026-agentic-safety/</guid><description>Our empirical study characterizing operational safety failures of LLM-based coding agents was accepted to ASE 2026 in Munich, Germany.</description></item><item><title>Copilot teaching tutorial at the NSF AI Unlocked workshop</title><link>https://resaid-lab.github.io/news/ai-unlocked-copilot-tutorial/</link><pubDate>Fri, 15 May 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/ai-unlocked-copilot-tutorial/</guid><description>Sumon Biswas presented the hands-on tutorial &amp;lsquo;Teaching Code-Generation Courses with GitHub Copilot&amp;rsquo; at the NSF AI Unlocked workshop, hosted by CU Boulder Research Computing with ACCESS and the NAIRR Pilot.</description></item><item><title>Ruksaar Shaik defends M.S. project and graduates</title><link>https://resaid-lab.github.io/news/ruksaar-ms-graduation/</link><pubDate>Fri, 15 May 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/ruksaar-ms-graduation/</guid><description>Ruksaar Shaik successfully defended her M.S. project and graduated in May 2026. Her work focused on building domain-specific LLM models for detecting intimate partner violence (IPV) from natural-language social media posts.</description></item><item><title>Ruksaar Shaik presents IPV-detection poster at the CTSC AI Summit</title><link>https://resaid-lab.github.io/news/ruksaar-ctsc-ai-summit-poster/</link><pubDate>Sun, 10 May 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/ruksaar-ctsc-ai-summit-poster/</guid><description>Ruksaar Shaik presented her poster on detecting intimate partner violence (IPV) with large language models at the CTSC AI Summit in May 2026.</description></item><item><title>Plan Then Action accepted to ICML 2026</title><link>https://resaid-lab.github.io/news/icml-2026-pta/</link><pubDate>Wed, 01 Apr 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/icml-2026-pta/</guid><description>Our paper on high-level planning guidance reinforcement learning for LLM reasoning was accepted to ICML 2026 in Seoul, South Korea.</description></item><item><title>General Chair of the LLMTrust workshop</title><link>https://resaid-lab.github.io/news/llmtrust-general-chair/</link><pubDate>Sun, 01 Feb 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/llmtrust-general-chair/</guid><description>Sumon Biswas is serving as General Chair of the International Workshop on Trustworthy Large Language Models for Software Engineering (LLMTrust).</description></item><item><title>Bias Testing and Mitigation in Black Box LLMs using Metamorphic Relations</title><link>https://resaid-lab.github.io/publications/llm-bias-preprint-2026/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/llm-bias-preprint-2026/</guid><description/></item><item><title>UCITE Learning Fellow</title><link>https://resaid-lab.github.io/news/ucite-learning-fellow/</link><pubDate>Mon, 01 Dec 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/ucite-learning-fellow/</guid><description>Sumon Biswas was accepted as a UCITE Learning Fellow at Case Western Reserve University.</description></item><item><title>CTSC pilot grant for AI-assisted dating-violence prevention</title><link>https://resaid-lab.github.io/news/ctsc-ipv-grant/</link><pubDate>Mon, 01 Sep 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/ctsc-ipv-grant/</guid><description>Sumon Biswas was awarded a CWRU CTSC pilot grant to study fairness assessment and improvement for AI-enabled detection of dating violence in youth digital communication, in collaboration with University Hospitals.</description></item><item><title>Avyukth Sai Rangarajan</title><link>https://resaid-lab.github.io/people/avyukth-sai-rangarajan/</link><pubDate>Fri, 15 Aug 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/avyukth-sai-rangarajan/</guid><description/></item><item><title>Devak Pardasani</title><link>https://resaid-lab.github.io/people/devak-pardasani/</link><pubDate>Fri, 15 Aug 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/devak-pardasani/</guid><description/></item><item><title>Phat Dang</title><link>https://resaid-lab.github.io/people/phat-dang/</link><pubDate>Fri, 15 Aug 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/phat-dang/</guid><description/></item><item><title>Ram Aryan Mallampati</title><link>https://resaid-lab.github.io/people/ram-aryan-mallampati/</link><pubDate>Fri, 15 Aug 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/ram-aryan-mallampati/</guid><description/></item><item><title>Sam Lin</title><link>https://resaid-lab.github.io/people/sam-lin/</link><pubDate>Fri, 15 Aug 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/sam-lin/</guid><description/></item><item><title>Zelan Eroz Espanto</title><link>https://resaid-lab.github.io/people/zelan-eroz-espanto/</link><pubDate>Fri, 15 Aug 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/zelan-eroz-espanto/</guid><description/></item><item><title>Invited talk at Kent State AUTOBOT</title><link>https://resaid-lab.github.io/news/kent-state-autobot-talk/</link><pubDate>Tue, 01 Jul 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/kent-state-autobot-talk/</guid><description>Sumon Biswas gave an invited talk, Engineering Responsible AI: From Fairness to Long-term Impact, at the Robotics and Autonomous Systems (AUTOBOT) Program at Kent State University.</description></item><item><title>FairSense: Long-Term Fairness Analysis of ML-Enabled Systems</title><link>https://resaid-lab.github.io/publications/fairsense-icse25/</link><pubDate>Thu, 01 May 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/fairsense-icse25/</guid><description/></item><item><title>Long-Term Fairness and ML Safety</title><link>https://resaid-lab.github.io/projects/fairsense/</link><pubDate>Thu, 01 May 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/fairsense/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Many ML-enabled systems operate in dynamic environments: the system&amp;rsquo;s decisions
change the environment, and those changes feed back into its future inputs. Certain
self-reinforcing loops can amplify errors, entrench bias, and cause fairness
violations in the long term even when immediate outcomes are fair. In predictive
policing, for example, a model that flags a neighborhood as high-crime sends more
patrols there, producing more recorded arrests, which the model reads as even higher
crime. The same pattern appears in loan approvals that affect credit scores and in
medical risk scoring that influences treatment access.&lt;/p&gt;</description></item><item><title>New Faculty Symposium at ICSE 2025</title><link>https://resaid-lab.github.io/news/icse-2025-new-faculty/</link><pubDate>Thu, 01 May 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/icse-2025-new-faculty/</guid><description>Sumon Biswas attended the New Faculty Symposium at ICSE 2025 in Ottawa, Canada.</description></item><item><title>Data science pipelines work featured on New Books Network</title><link>https://resaid-lab.github.io/news/new-books-network-ds-pipelines/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/new-books-network-ds-pipelines/</guid><description>The story behind The Art and Practice of Data Science Pipelines was featured on the New Books Network podcast.</description></item><item><title>FairSense accepted to ICSE 2025</title><link>https://resaid-lab.github.io/news/fairsense-icse-2025/</link><pubDate>Fri, 01 Nov 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/fairsense-icse-2025/</guid><description>Our paper on long-term fairness analysis of ML-enabled systems was accepted to ICSE 2025 in Ottawa, Canada.</description></item><item><title>Ali Nawaf</title><link>https://resaid-lab.github.io/people/ali-nawaf/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/ali-nawaf/</guid><description/></item><item><title>Anika Kaur</title><link>https://resaid-lab.github.io/people/anika-kaur/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/anika-kaur/</guid><description/></item><item><title>Jitong Zou</title><link>https://resaid-lab.github.io/people/jitong-zou/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/jitong-zou/</guid><description/></item><item><title>Khue Luong</title><link>https://resaid-lab.github.io/people/khue-luong/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/khue-luong/</guid><description/></item><item><title>Maximillian Schulten</title><link>https://resaid-lab.github.io/people/maximillian-schulten/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/maximillian-schulten/</guid><description/></item><item><title>Panimalar Gobichettipalayam Annadurai</title><link>https://resaid-lab.github.io/people/panimalar-annadurai/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/panimalar-annadurai/</guid><description/></item><item><title>Ruksaar Shaik</title><link>https://resaid-lab.github.io/people/ruksaar-shaik/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/ruksaar-shaik/</guid><description/></item><item><title>Sharon Sharma</title><link>https://resaid-lab.github.io/people/sharon-sharma/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/sharon-sharma/</guid><description/></item><item><title>Towsif Raiyan</title><link>https://resaid-lab.github.io/people/towsif-raiyan/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/towsif-raiyan/</guid><description/></item><item><title>Zhihao Dou</title><link>https://resaid-lab.github.io/people/zhihao-dou/</link><pubDate>Thu, 15 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/zhihao-dou/</guid><description/></item><item><title>reSAID Lab starts at CWRU</title><link>https://resaid-lab.github.io/news/cwru-faculty-start/</link><pubDate>Thu, 01 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/news/cwru-faculty-start/</guid><description>Sumon Biswas joined Case Western Reserve University as a tenure-track faculty member and started building the reSAID Lab.</description></item><item><title>Sumon Biswas</title><link>https://resaid-lab.github.io/people/sumon-biswas/</link><pubDate>Thu, 01 Aug 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/people/sumon-biswas/</guid><description>&lt;h2 id="about"&gt;About&lt;/h2&gt;
&lt;p&gt;Sumon Biswas is an Assistant Professor in the Department of Computer and Data Sciences at Case Western Reserve University. He received his Ph.D. in Computer Science from Iowa State University and was a Postdoctoral Researcher at Carnegie Mellon University. His research operates at the intersection of Software Engineering and AI, with an emphasis on engineering responsible AI systems using both formal and empirical approaches.&lt;/p&gt;
&lt;h2 id="contact"&gt;Contact&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Email:&lt;/strong&gt; &lt;a href="mailto:sumon@case.edu"&gt;sumon@case.edu&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Office:&lt;/strong&gt; Olin Building 608, Cleveland, OH 44106&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Phone:&lt;/strong&gt; 216.368.1494&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Are Prompt Engineering and TODO Comments Friends or Foes? An Evaluation on GitHub Copilot</title><link>https://resaid-lab.github.io/publications/todo-copilot-icse24/</link><pubDate>Sun, 14 Apr 2024 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/todo-copilot-icse24/</guid><description/></item><item><title>Towards Safe ML-Based Systems in Presence of Feedback Loops</title><link>https://resaid-lab.github.io/publications/safe-ml-fse23/</link><pubDate>Mon, 04 Dec 2023 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/safe-ml-fse23/</guid><description/></item><item><title>Fix Fairness, Don't Ruin Accuracy: Performance Aware Fairness Repair using AutoML</title><link>https://resaid-lab.github.io/publications/fair-automl-fse23/</link><pubDate>Sun, 03 Dec 2023 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/fair-automl-fse23/</guid><description/></item><item><title>Fair-AutoML: Performance-Aware Fairness Repair</title><link>https://resaid-lab.github.io/projects/fair-automl/</link><pubDate>Wed, 15 Nov 2023 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/fair-automl/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Bias mitigation algorithms typically work only in specific situations and often
repair fairness at the cost of large accuracy drops, making them impractical for
critical decision-making software. We treated fairness repair as a
performance-aware optimization problem: fix the bias in a buggy model without
ruining its accuracy.&lt;/p&gt;
&lt;p&gt;Fair-AutoML, built on a state-of-the-art AutoML tool, made this concrete through
two changes to standard AutoML: an optimization function that incorporates
fairness objectives alongside accuracy, and a fairness-aware search space over
candidate model configurations. A search-space pruning method further reduced
computational cost and repair time.&lt;/p&gt;</description></item><item><title>Safety Assurance of ML-Based Systems</title><link>https://resaid-lab.github.io/projects/safety-assurance-ml/</link><pubDate>Wed, 01 Nov 2023 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/safety-assurance-ml/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;ML-based software makes predictions in settings where failures carry real safety consequences. Our motivating case study was the DHS passenger screening challenge, hosted on Kaggle with the largest prize pool in its history ($1.5 million): TSA screens more than two million passengers daily, high false alarm rates create checkpoint bottlenecks, and false negatives pose severe safety risks. We built abstractions of such ML-enabled systems and inferred preconditions that provide probable guarantees on the safety of their predictions.&lt;/p&gt;</description></item><item><title>Fairify: Fairness Verification of Neural Networks</title><link>https://resaid-lab.github.io/projects/fairify/</link><pubDate>Mon, 15 May 2023 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/fairify/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;We built Fairify, an SMT-based approach that verifies individual fairness of deep
neural networks in production. Individual fairness requires that any two individuals
who differ only in protected attributes such as race, sex, or age receive similar
predictions; unlike group metrics, it captures worst-case discrimination. The
property is hard to verify because it must be checked globally over the input
domain and because of the non-linear computation nodes in the network.&lt;/p&gt;</description></item><item><title>Fairify: Fairness Verification of Neural Networks</title><link>https://resaid-lab.github.io/publications/fairify-icse23/</link><pubDate>Sun, 14 May 2023 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/fairify-icse23/</guid><description/></item><item><title>Towards Understanding Fairness and its Composition in Ensemble Machine Learning</title><link>https://resaid-lab.github.io/publications/ensemble-fairness-icse23/</link><pubDate>Sun, 14 May 2023 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/ensemble-fairness-icse23/</guid><description/></item><item><title>Causal Fairness in Machine Learning Pipelines</title><link>https://resaid-lab.github.io/projects/causal-fairness-pipelines/</link><pubDate>Mon, 01 May 2023 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/causal-fairness-pipelines/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Most fairness research treated a machine learning model as a single black box, measuring bias only from its final predictions. Real pipelines, however, contain an ordered set of components — data filtering, imputation, encoding, feature transformation, training, tuning — and each can affect the fairness of the resulting model. We investigated fairness at the component level: using causal reasoning, we intervened on one stage at a time, constructed an alternative pipeline without that stage, and measured the resulting prediction disparity to attribute unfairness to specific components.&lt;/p&gt;</description></item><item><title>23 Shades of Self-Admitted Technical Debt: An Empirical Study on Machine Learning Software</title><link>https://resaid-lab.github.io/publications/ml-techdebt-fse22/</link><pubDate>Mon, 14 Nov 2022 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/ml-techdebt-fse22/</guid><description/></item><item><title>ML Software Maintenance and Technical Debt</title><link>https://resaid-lab.github.io/projects/ml-techdebt/</link><pubDate>Mon, 14 Nov 2022 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/ml-techdebt/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;ML software has distinctive maintenance risks because data, models, pipelines, and code evolve together. Technical debt can infect the data that models are trained on, degrading the functional performance of ML systems in ways traditional debt does not, and the growing inclusion of ML components in modern software introduces new kinds of debt.&lt;/p&gt;
&lt;p&gt;We study how this debt appears in ML repositories in the wild. Mining 68,821 self-admitted technical debts (SATDs) from all revisions of 2,686 mature ML repositories on GitHub, we build taxonomies of ML-specific debt, locate the pipeline stages where it accumulates, and track how it is introduced and removed — evidence developers and researchers can use to build maintainable ML systems.&lt;/p&gt;</description></item><item><title>The Art and Practice of Data Science Pipelines: A Comprehensive Study of Data Science Pipelines In Theory, In-The-Small, and In-The-Large</title><link>https://resaid-lab.github.io/publications/ds-pipeline-icse22/</link><pubDate>Sat, 21 May 2022 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/ds-pipeline-icse22/</guid><description/></item><item><title>Large-Scale Mining of Data Science Software</title><link>https://resaid-lab.github.io/projects/mining-ds-software/</link><pubDate>Sun, 01 May 2022 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/projects/mining-ds-software/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Data science components have become common in software, yet software engineering research on this class of systems needed data and tooling that did not exist. We built an infrastructure to mine data science software from GitHub at scale: we extended the Boa framework to parse Python using ANTLR grammars for Python 2 and 3, transformed the source into ASTs stored in Boa&amp;rsquo;s Protobuf format, and hosted the result on a Hadoop cluster where Boa&amp;rsquo;s domain-specific language runs automatically parallelized queries. The resulting dataset covered 1,558 mature, top-rated data science projects — about 5 million Python file snapshots across all revisions — and was later extended to parse Jupyter notebooks.&lt;/p&gt;</description></item><item><title>Fair Preprocessing: Towards Understanding Compositional Fairness of Data Transformers in Machine Learning Pipeline</title><link>https://resaid-lab.github.io/publications/fair-preprocessing-fse21/</link><pubDate>Mon, 23 Aug 2021 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/fair-preprocessing-fse21/</guid><description/></item><item><title>Do the Machine Learning Models on a Crowd Sourced Platform Exhibit Bias? An Empirical Study on Model Fairness</title><link>https://resaid-lab.github.io/publications/ml-fairness-fse20/</link><pubDate>Sun, 08 Nov 2020 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/ml-fairness-fse20/</guid><description/></item><item><title>Boa Meets Python: A Boa Dataset of Data Science Software in Python Language</title><link>https://resaid-lab.github.io/publications/boa-python-msr19/</link><pubDate>Sun, 26 May 2019 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/publications/boa-python-msr19/</guid><description/></item></channel></rss>