<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Posts on reSAID Lab</title><link>https://resaid-lab.github.io/posts/</link><description>Recent content in Posts on reSAID Lab</description><generator>Hugo</generator><language>en-US</language><lastBuildDate>Thu, 11 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://resaid-lab.github.io/posts/index.xml" rel="self" type="application/rss+xml"/><item><title>What Actually Breaks When LLMs Write Code?</title><link>https://resaid-lab.github.io/posts/what-breaks-when-llms-code/</link><pubDate>Thu, 11 Jun 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/posts/what-breaks-when-llms-code/</guid><description>&lt;p&gt;Most conversations about the safety of coding agents revolve around adversarial
scenarios: prompt injection, jailbreaks, malicious instructions hidden in a
README. Those threats are real. But after watching these tools work — and
occasionally watching them wreck a working environment while &amp;ldquo;fixing&amp;rdquo; a unit
test — we kept returning to a more uncomfortable question: what goes wrong when
nobody is attacking, and the agent is simply trying to help?&lt;/p&gt;
&lt;p&gt;Our new preprint, &lt;a href="https://resaid-lab.github.io/publications/agentic-code-safety-preprint-2026/"&gt;What Breaks When LLMs Code?&lt;/a&gt;,
led by our Ph.D. student &lt;a href="https://resaid-lab.github.io/people/alif-al-hasan/"&gt;Alif Al Hasan&lt;/a&gt;, is an attempt
to answer that question with evidence rather than anecdotes. We call this
&lt;em&gt;operational safety&lt;/em&gt;: the safety of an agent during benign, goal-directed,
everyday use.&lt;/p&gt;</description></item><item><title>Teaching LLMs to Plan Before They Act</title><link>https://resaid-lab.github.io/posts/plan-then-action-icml/</link><pubDate>Wed, 10 Jun 2026 00:00:00 +0000</pubDate><guid>https://resaid-lab.github.io/posts/plan-then-action-icml/</guid><description>&lt;p&gt;If you have ever watched a language model reason its way through a hard math
problem, you have probably seen it wander. The chain of thought starts off
promising, circles back on itself, re-derives something it already knew, and
occasionally talks itself out of a correct intermediate result. The final answer
may still be right, but the path there is long, redundant, and hard to trust.&lt;/p&gt;
&lt;p&gt;Our ICML 2026 paper, &lt;a href="https://resaid-lab.github.io/publications/plan-then-action-icml26/"&gt;Plan Then Action&lt;/a&gt;,
starts from a simple diagnosis of why this happens: autoregressive generation is
local. At every step the model decides only what token comes next, so the
reasoning process is essentially a sequence of small, greedy decisions. There is
no global plan — nothing that commits the model to a strategy before it starts
executing one. Tree search and reinforcement learning can partially compensate,
but they are expensive and still operate over the same token-level process.&lt;/p&gt;</description></item></channel></rss>