2025/07 - blog.lmorchard.com

Month: 2025/07

2025 July 10
- Hello world!
- Trying to decide if this is a "still alive" or a "beware, I live!" kind of day?
- Either way, I've been down a hole of busy-ness for the past few weeks and been wanting to climb out to emit some reports here.
- Still trying to find a good balance while spinning plates across multiple projects. But, I've gotten a lot done without quite going entirely insane.
- Feel like I've suddenly become a cyborg over the past few weeks. Been working across multiple instances of Claude Code all day, every day.
- Also feels like I've picked a side in the war against Skynet in some folks' estimation. Except, to me, it feels like working with a concussed version of the main computer from the USS Enterprise (NCC-1701-D)—which is still pretty advanced for the 21st century.
- I've gotten well acquainted with Anthropic's usage limits, having managed to reliably burn through my Max plan allowances every 5 hours or so.
- I'm considering implementing usage monitoring to better understand my consumption patterns with Claude.
- Both Claude and I could probably use a brief vacation. I guess the kids call that a "micro-retirement" these days?
# 11:59 pm
- miscellanea
- Hello world!
- Trying to decide if this is a "still alive" or a "beware, I live!" kind of day?
- Either way, I've been down a hole of busy-ness for the past few weeks and been wanting to climb out to emit some reports here.
- Keep thinking that I may want to intentionally carve out space and time for weeknotes again.
- Still trying to find a good balance while spinning plates across multiple projects. But, I've gotten a lot done without quite going entirely insane.
- Feel like I've suddenly become a cyborg over the past few weeks. Been working across multiple instances of Claude Code all day, every day.
- I've gotten well acquainted with Anthropic's usage limits, having managed to reliably burn through my Max plan allowances every 5 hours or so.
- I'm considering implementing usage monitoring to better understand my consumption patterns with Claude.
- Also feels like I've picked a side in the war against Skynet in some folks' estimation. Except, to me, it feels like working with a concussed version of the main computer from the USS Enterprise (NCC-1701-D)—which is still pretty advanced for the 21st century.
- Both Claude and I could probably use a brief vacation. I guess the kids call that a "micro-retirement" these days?
# 11:59 pm
- miscellanea
- Hello world!
- Trying to decide if this is a "still alive" or a "beware, I live!" kind of day?
- Either way, I've been down a hole of busy-ness for the past few weeks and been wanting to climb out to emit some reports here.
- Keep thinking that I may want to intentionally carve out space and time for weeknotes again.
- Still trying to find a good balance while spinning plates across multiple projects. But, I've gotten a lot done without quite going entirely insane.
- Feel like I've suddenly become a cyborg over the past few weeks. Been working across multiple instances of Claude Code all day, every day.
- I've gotten well acquainted with Anthropic's usage limits, having managed to reliably burn through my Max plan allowances every 5 hours or so.
- I'm considering implementing usage monitoring to better understand my consumption patterns with Claude.
- Also feels like I've picked a side in the war against Skynet in some folks' estimation. Except, to me, it feels like working with a concussed version of the main computer from the USS Enterprise (NCC-1701-D)—which is still pretty advanced for the 21st century.
- Both Claude and I could probably use a brief vacation. I guess the kids call that a "micro-retirement" these days?
# 11:59 pm
- miscellanea
Do AI tools really slow me down by 20%?

Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity - METR:

We conduct a randomized controlled trial (RCT) to understand how early-2025 AI tools affect the productivity of experienced open-source developers working on their own repositories. Surprisingly, we find that when developers use AI tools, they take 19% longer than without—AI makes them slower.

I'm seeing plenty of "I told you so" as this makes the rounds. But, having spent the past month deep in AI-assisted coding, it directly contradicts my experience. Maybe I've drunk too much Kool-Aid, but I don't think I'm entirely delusional. I want to head-scratch through this, though.

Small sample, specific scenario

we recruited 16 experienced developers from large open-source repositories (averaging 22k+ stars and 1M+ lines of code) that they’ve contributed to for multiple years.

That's a rather small sample and a very specific scenario, isn't it?

The researchers themselves acknowledge the limitations:

We caution readers against overgeneralizing on the basis of our results. The slowdown we observe does not imply that current AI tools do not often improve developer’s productivity—we find evidence that the high developer familiarity with repositories and the size and maturity of the repositories both contribute to the observed slowdown, and these factors do not apply in many software development settings. For example, our results are consistent with small greenfield projects or development in unfamiliar codebases seeing substantial speedup from AI assistance.

Maybe this bit from the "Key Caveats" section basically jibes with my experience? My recent work (admittedly, a sample of 1) has indeed been largely with greenfield projects and relatively unfamiliar codebases.

It does matter how you use it

The researchers hint at something important:

We expect that AI systems that have higher fundamental reliability, lower latency, and/or are better elicited (e.g. via more inference compute/tokens, more skilled prompt-ing/scaffolding, or explicit fine-tuning on repositories) could speed up developers in our setting (i.e. experienced open-source developers on large repositories).

I think this rhymes with my experience. When I've just charged into a rambling chat & autocomplete session with Cursor, things steer into the ditch early and often.

But when I've worked with Claude Code through a multi-step process of describing the problem, asking the agent to prompt me with clarifying questions, reviewing the problem and considering a solution, breaking it down into parts, and then asking the agent to methodically execute—that's yielded decently reliable success.

Waiting, or lack thereof

The study notes:

All else equal, faster AI generations would result in developers being slowed down less. Qualitatively, a minority of developers note that they spend significant time waiting on AI to generate code.

I rarely wait, because I'm juggling multiple projects. When one agent instance is working, I switch to another window. Sometimes it's a separate git worktree of the same codebase. Yes, context switching is tiring, but it also seems to help me overcome ADHD-related activation energy barriers?

Over the years, there've been days when I just sit there staring at the IDE window, poking my brain with a stick saying "c'mon, do something" and nothing happens for an hour or more. I'm not planning my next move, I'm just dissociating. My executive function doesn't, like, function. Often. My own brain makes me wait long periods of time before it starts generating useful results. 😅

Maybe it's the cycling novelty that gets me going? I enjoy task switching between prosing and coding. I enjoy finding that the model appears to have "read" everything—evidenced by it echoing my intent back in code or follow-up questions. I enjoy discovering that while I was in another window, new things happened in the background for me to review.

I've also found that many agents are reliable at handling drudgery. Re-jiggering data structures, applying repeated refactorings, etc. Those tasks can seize me up for tens of minutes at a time with brain-killing waves of tedium. But usually, I can just tell the bot to do it, while I turn to more interesting stuff.

Summing up

Although the influence of experimental artifacts cannot be entirely ruled out, the robustness of the slowdown effect across our analyses suggests it is unlikely to primarily be a function of our experimental design.

This study provides one data point about one specific scenario: experienced developers using specific tools on massive, mature codebases. The researchers themselves caution against overgeneralization, noting that different contexts likely yield different results.

These tools aren't magic and they're not universally beneficial. But dismissing them based on this narrow study would be premature. The key is understanding when, how, and why to use them—something that's still evolving rapidly as both tools and techniques improve.
# 4:26 pm
Do AI tools really slow me down by 20%?

Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity - METR:

We conduct a randomized controlled trial (RCT) to understand how early-2025 AI tools affect the productivity of experienced open-source developers working on their own repositories. Surprisingly, we find that when developers use AI tools, they take 19% longer than without—AI makes them slower.

I'm seeing plenty of "I told you so" as this makes the rounds. But, having spent the past month deep in AI-assisted coding, it directly contradicts my experience. Maybe I've drunk too much Kool-Aid, but I don't think I'm entirely delusional. I want to head-scratch through this, though.

Small sample, specific scenario

we recruited 16 experienced developers from large open-source repositories (averaging 22k+ stars and 1M+ lines of code) that they’ve contributed to for multiple years.

That's a rather small sample and a very specific scenario, isn't it?

The researchers themselves acknowledge the limitations:

We caution readers against overgeneralizing on the basis of our results. The slowdown we observe does not imply that current AI tools do not often improve developer’s productivity—we find evidence that the high developer familiarity with repositories and the size and maturity of the repositories both contribute to the observed slowdown, and these factors do not apply in many software development settings. For example, our results are consistent with small greenfield projects or development in unfamiliar codebases seeing substantial speedup from AI assistance.

Maybe this double-negative-enriched bit from the "Key Caveats" section basically jibes with my experience? My recent work (admittedly, a sample of 1) has indeed been largely with greenfield projects and relatively unfamiliar codebases.

It does matter how you use it

The researchers hint at something important:

We expect that AI systems that have higher fundamental reliability, lower latency, and/or are better elicited (e.g. via more inference compute/tokens, more skilled prompt-ing/scaffolding, or explicit fine-tuning on repositories) could speed up developers in our setting (i.e. experienced open-source developers on large repositories).

I think this rhymes with my experience. When I've just charged into a rambling chat & autocomplete session with Cursor, things steer into the ditch early and often.

But when I've worked with Claude Code through a multi-step process of describing the problem, asking the agent to prompt me with clarifying questions, reviewing the problem and considering a solution, breaking it down into parts, and then asking the agent to methodically execute—that's yielded decently reliable success.

Waiting, or lack thereof

The study notes:

All else equal, faster AI generations would result in developers being slowed down less. Qualitatively, a minority of developers note that they spend significant time waiting on AI to generate code.

I rarely wait, because I'm juggling multiple projects. When one agent instance is working, I switch to another window. Sometimes it's a separate git worktree of the same codebase. Yes, context switching is tiring, but it also seems to help me overcome ADHD-related activation energy barriers?

Over the years, there've been days when I just sit there staring at the IDE window, poking my brain with a stick saying "c'mon, do something" and nothing happens for an hour or more. I'm not planning my next move, I'm just dissociating. My executive function doesn't, like, function. Often. My own brain makes me wait long periods of time before it starts generating useful results. 😅

Maybe it's the cycling novelty that gets me going? I enjoy task switching between prosing and coding. I enjoy finding that the model appears to have "read" everything—evidenced by it echoing my intent back in code or follow-up questions. I enjoy discovering that while I was in another window, new things happened in the background for me to review.

I've also found that many agents are reliable at handling drudgery. Re-jiggering data structures, applying repeated refactorings, etc. Those tasks can seize me up for tens of minutes at a time with brain-killing waves of tedium. But usually, I can just tell the bot to do it, while I turn to more interesting stuff.

Summing up

Although the influence of experimental artifacts cannot be entirely ruled out, the robustness of the slowdown effect across our analyses suggests it is unlikely to primarily be a function of our experimental design.

This study provides one data point about one specific scenario: experienced developers using specific tools on massive, mature codebases. The researchers themselves caution against overgeneralization, noting that different contexts likely yield different results.

These tools aren't magic and they're not universally beneficial. But dismissing them based on this narrow study would be premature. The key is understanding when, how, and why to use them—something that's still evolving rapidly as both tools and techniques improve.
# 4:26 pm
Do AI tools really slow me down by 20%?

Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity - METR:

We conduct a randomized controlled trial (RCT) to understand how early-2025 AI tools affect the productivity of experienced open-source developers working on their own repositories. Surprisingly, we find that when developers use AI tools, they take 19% longer than without—AI makes them slower.

I'm seeing plenty of "I told you so" as this makes the rounds. But, having spent the past month deep in AI-assisted coding, it directly contradicts my experience. Maybe I've drunk too much Kool-Aid, but I don't think I'm entirely delusional. I want to head-scratch through this, though.

Small sample, specific scenario

we recruited 16 experienced developers from large open-source repositories (averaging 22k+ stars and 1M+ lines of code) that they’ve contributed to for multiple years.

That's a rather small sample and a very specific scenario, isn't it?

The researchers themselves acknowledge the limitations:

We caution readers against overgeneralizing on the basis of our results. The slowdown we observe does not imply that current AI tools do not often improve developer’s productivity—we find evidence that the high developer familiarity with repositories and the size and maturity of the repositories both contribute to the observed slowdown, and these factors do not apply in many software development settings. For example, our results are consistent with small greenfield projects or development in unfamiliar codebases seeing substantial speedup from AI assistance.

Maybe this double-negative-enriched bit from the "Key Caveats" section basically jibes with my experience? My recent work (admittedly, a sample of 1) has indeed been largely with greenfield projects and relatively unfamiliar codebases.

It does matter how you use it

The researchers hint at something important:

We expect that AI systems that have higher fundamental reliability, lower latency, and/or are better elicited (e.g. via more inference compute/tokens, more skilled prompt-ing/scaffolding, or explicit fine-tuning on repositories) could speed up developers in our setting (i.e. experienced open-source developers on large repositories).

I think this rhymes with my experience. When I've just charged into a rambling chat & autocomplete session with Cursor, things steer into the ditch early and often.

But when I've worked with Claude Code through a multi-step process of describing the problem, asking the agent to prompt me with clarifying questions, reviewing the problem and considering a solution, breaking it down into parts, and then asking the agent to methodically execute—that's yielded decently reliable success.

Waiting, or lack thereof

The study notes:

All else equal, faster AI generations would result in developers being slowed down less. Qualitatively, a minority of developers note that they spend significant time waiting on AI to generate code.

I rarely wait, because I'm juggling multiple projects. When one agent instance is working, I switch to another window. Sometimes it's a separate git worktree of the same codebase. Yes, context switching is tiring, but it also seems to help me overcome ADHD-related activation energy barriers?

Over the years, there've been days when I just sit there staring at the IDE window, poking my brain with a stick saying "c'mon, do something" and nothing happens for an hour or more. I'm not planning my next move, I'm just dissociating. My executive function doesn't, like, function. Often. My own brain makes me wait long periods of time before it starts generating useful results. 😅

Maybe it's the cycling novelty that gets me going? I enjoy task switching between prosing and coding. I enjoy finding that the model appears to have "read" everything—evidenced by it echoing my intent back in code or follow-up questions. I enjoy discovering that while I was in another window, new things happened in the background for me to review.

I've also found that many agents are reliable at handling drudgery. Re-jiggering data structures, applying repeated refactorings, etc. Those tasks can seize me up for tens of minutes at a time with brain-killing waves of tedium. But usually, I can just tell the bot to do it, while I turn to more interesting stuff.

Summing up

Although the influence of experimental artifacts cannot be entirely ruled out, the robustness of the slowdown effect across our analyses suggests it is unlikely to primarily be a function of our experimental design.

This study provides one data point about one specific scenario: experienced developers using specific tools on massive, mature codebases. The researchers themselves caution against overgeneralization, noting that different contexts likely yield different results.

These tools aren't magic and they're not universally beneficial. But dismissing them based on this narrow study would be premature. The key is understanding when, how, and why to use them—something that's still evolving rapidly as both tools and techniques improve.
# 4:26 pm
Progress on Pebbling Club

I've been procrastinating getting back to it, but I finally threw some hours into a substantial overhaul of my Pebbling Club web link sharing project—the first real efforts since December! Migrated from SQLite to Postgres, switched to uv for dependency management, and moved deployment from fly.io to my basement machine running Docker Compose.

I built my own git-push deployment post-receive hook because I'm a masochist—er, I mean I wanted complete control over the deployment process. It's nice watching your own server rebuild containers when you push to main, even if cloud platforms would be more practical.

The development environment became a hybrid: Docker Compose for stable services, Honcho + Procfile for active development. Added Flower for Celery monitoring and experimented with Prometheus and Grafana metrics. (But, then, I reverted django-prometheus because it doesn't work at all like I thought it did.)

I got several useful features working: RSS feed reading, duplicate URL detection through normalized hashing, ActivityStreams-inspired import/export, and a Netscape Bookmarks HTML export (for fun). Built a link inbox that currently handles RSS feeds, with in-progress work to add Mastodon timeline integration and plans for Bluesky.

Along the way, I wanted to see how far I could get with Claude Code and make tweaks to my overall process. If nothing else, the bot helped me get past the barrier of activation energy to get some things done that I've put off for most of a year. The bot wrote a bunch of just-fine code—and where it was wrong, the wrongness motivated me to get it fixed and done myself.
# 11:54 am
Progress on Pebbling Club

I've been procrastinating getting back to it, but I finally threw some hours into a substantial overhaul of my Pebbling Club web link sharing project—the first real efforts since December! Migrated from SQLite to Postgres, switched to uv for dependency management, and moved deployment from fly.io to my basement machine running Docker Compose.

I built my own git-push deployment post-receive hook because I'm a masochist—er, I mean I wanted complete control over the deployment process. It's nice watching your own server rebuild containers when you push to main, even if cloud platforms would be more practical.

The development environment became a hybrid: Docker Compose for stable services, Honcho + Procfile for active development. Added Flower for Celery monitoring and experimented with Prometheus and Grafana metrics. (But, then, I reverted django-prometheus because it doesn't work at all like I thought it did.)

I got several useful features working: RSS feed reading, duplicate URL detection through normalized hashing, ActivityStreams-inspired import/export, and a Netscape Bookmarks HTML export (for fun). Built a link inbox that currently handles RSS feeds, with in-progress work to add Mastodon timeline integration and plans for Bluesky.

Along the way, I wanted to see how far I could get with Claude Code and make tweaks to my overall process. If nothing else, the bot helped me get past the barrier of activation energy to get some things done that I've put off for most of a year. The bot wrote a bunch of just-fine code—and where it was wrong, the wrongness motivated me to get it fixed and done myself.
# 11:54 am
Progress on Pebbling Club

I've been procrastinating getting back to it, but I finally threw some hours into a substantial overhaul of my Pebbling Club web link sharing project—the first real efforts since December! Migrated from SQLite to Postgres, switched to uv for dependency management, and moved deployment from fly.io to my basement machine running Docker Compose.

I built my own git-push deployment post-receive hook because I'm a masochist—er, I mean I wanted complete control over the deployment process. It's nice watching your own server rebuild containers when you push to main, even if cloud platforms would be more practical.

The development environment became a hybrid: Docker Compose for stable services, Honcho + Procfile for active development. Added Flower for Celery monitoring and experimented with Prometheus and Grafana metrics. (But, then, I reverted django-prometheus because it doesn't work at all like I thought it did.)

I got several useful features working: RSS feed reading, duplicate URL detection through normalized hashing, ActivityStreams-inspired import/export, and a Netscape Bookmarks HTML export (for fun). Built a link inbox that currently handles RSS feeds, with in-progress work to add Mastodon timeline integration and plans for Bluesky.

Along the way, I wanted to see how far I could get with Claude Code and make tweaks to my overall process. If nothing else, the bot helped me get past the barrier of activation energy to get some things done that I've put off for most of a year. The bot wrote a bunch of just-fine code—and where it was wrong, the wrongness motivated me to get it fixed and done myself.
# 11:54 am
Building a Breakout clone with Claude

To make steps toward showing and telling about my Claude Code workflow, I built a browser-based Breakout game with Phaser 3. The repository captures the full development process so far—prompts, commands, and session transcripts.

Along with the basic game, I added a multi-ball power-up to demonstrate iterative development. The game itself isn't particularly novel, but the documented development process might be useful for others exploring AI-assisted coding workflows.

At some point, this will turn into a show-and-tell presentation for co-workers and maybe a follow-up to last month's blog post on "Baby steps into semi-automatic coding".

This took all of a couple hours on a Saturday afternoon on the couch watching TV, but I kind of want to keep going with it. It's rather addictive to just kind of riff on ideas and get them into the game with quick little iterations.
# 11:11 am
Building a Breakout clone with Claude

To make steps toward showing and telling about my Claude Code workflow, I built a browser-based Breakout game with Phaser 3. The repository captures the full development process so far—prompts, commands, and session transcripts.

Along with the basic game, I added a multi-ball power-up to demonstrate iterative development. The game itself isn't particularly novel, but the documented development process might be useful for others exploring AI-assisted coding workflows.

At some point, this will turn into a show-and-tell presentation for co-workers and maybe a follow-up to last month's blog post on "Baby steps into semi-automatic coding".

This took all of a couple hours on a Saturday afternoon on the couch watching TV, but I kind of want to keep going with it. It's rather addictive to just kind of riff on ideas and get them into the game with quick little iterations.
# 11:11 am
Building a Breakout clone with Claude

To make steps toward showing and telling about my Claude Code workflow, I built a browser-based Breakout game with Phaser 3. The repository captures the full development process so far—prompts, commands, and session transcripts.

Along with the basic game, I added a multi-ball power-up to demonstrate iterative development. The game itself isn't particularly novel, but the documented development process might be useful for others exploring AI-assisted coding workflows.

At some point, this will turn into a show-and-tell presentation for co-workers and maybe a follow-up to last month's blog post on "Baby steps into semi-automatic coding".

This took all of a couple hours on a Saturday afternoon on the couch watching TV, but I kind of want to keep going with it. It's rather addictive to just kind of riff on ideas and get them into the game with quick little iterations.
# 11:11 am

Month: 2025/07

2025 July 10

Small sample, specific scenario

It does matter how you use it

Waiting, or lack thereof

Summing up

Small sample, specific scenario

It does matter how you use it

Waiting, or lack thereof

Summing up

Small sample, specific scenario

It does matter how you use it

Waiting, or lack thereof

Summing up