<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Sql-Rewrite on Kent Yao</title>
    <link>https://yaooqinn.github.io/tags/sql-rewrite/</link>
    <description>Recent content in Sql-Rewrite on Kent Yao</description>
    <generator>Hugo -- 0.157.0</generator>
    <language>en-us</language>
    <lastBuildDate>Wed, 27 May 2026 00:00:00 +0000</lastBuildDate>
    <atom:link href="https://yaooqinn.github.io/tags/sql-rewrite/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>−46% or −2%? Rule-Based Rewriters Only Work at Home</title>
      <link>https://yaooqinn.github.io/posts/query-engines/rule-rewrite-blindspot-dsb/</link>
      <pubDate>Wed, 27 May 2026 00:00:00 +0000</pubDate>
      <guid>https://yaooqinn.github.io/posts/query-engines/rule-rewrite-blindspot-dsb/</guid>
      <description>On TPC-H 10GB, a state-of-the-art learned rewriter cuts mean execution time from 69.84s to 37.57s — a 46% win. On DSB 10GB, the same rewriter takes 32.62s to 31.93s — a 2.1% non-event. The gap isn&amp;rsquo;t query difficulty; it&amp;rsquo;s whether the benchmark is in the rewriter&amp;rsquo;s training distribution. &amp;ldquo;Rule-based systems are stable and reliable&amp;rdquo; is often a benchmark artifact, not an engineering fact.</description>
    </item>
    <item>
      <title>Just Asking an LLM to Rewrite SQL Does Almost Nothing</title>
      <link>https://yaooqinn.github.io/posts/query-engines/llm-only-rewrite-doesnt-work/</link>
      <pubDate>Tue, 26 May 2026 00:00:00 +0000</pubDate>
      <guid>https://yaooqinn.github.io/posts/query-engines/llm-only-rewrite-doesnt-work/</guid>
      <description>On TPC-H 10GB, asking GPT-4o to rewrite SQL takes mean execution time from 78.81s down to 74.92s — almost nothing. Swap in an open 14B model, feed it plans, add a reward, fine-tune once, and the same workload drops to 29.67s. Whether LLMs can help SQL rewriting is not a question about model strength; it&amp;rsquo;s a question about whether you&amp;rsquo;re willing to give the model the signals it actually needs.</description>
    </item>
  </channel>
</rss>
