Skip to content

Plutonic Rainbows

OpenAI Codex

I have switched over to Codex — it’s much cheaper, and for now it seems far more reliable. I’m not running into the problems that have plagued Claude Code over the past month.

I have managed to get Github integration, with Codex loading the appropriate model and permissions. I will probably use Gemini CLI for planning and stick with Codex for a few weeks.

Claude Code Fixed

This is what developers are essentially being told right now. After nearly a month of frankly appalling performance, Anthropic claims to have identified and resolved the issues. Yet the wording of their statement is so vague and non-specific that it offers little reassurance. It doesn’t explain what went wrong, what was actually fixed, or how developers can expect things to improve going forward. Instead, it leaves us with a cloud of ambiguity — an opaque message that feels more like damage control than genuine clarity.

Gail Elliott

Flux.1 [Dev]

Guardrails

I implemented a balanced guardrail system for the Claude Prompt Builder's adaptive complexity engine to address the verbosity concerns while maintaining essential safety checks. The changes have included some important modifying of the adaptive_prompt_builder.py to scale guardrails appropriately: simple tasks (≤800 chars) now receive concise core integrity principles and minimal quality assurance focused on testing, medium tasks (800-2000 chars) get balanced guidance with a condensed runtime priority wrapper and standard QA including lint/typecheck requirements, and complex tasks (2000+ chars) retain comprehensive orchestration with full guardrails. Key improvements involved creating three tiers of core integrity (minimal/balanced/full), implementing scaled quality assurance sections (minimal/standard/comprehensive), adding a concise runtime wrapper for medium complexity, and adjusting verbosity targets to realistic levels.

The system now ensures that even simple fix typo requests include essential testing reminders without overwhelming users with unnecessary orchestration details, while complex multi-domain tasks still receive the comprehensive guidance they require. Testing confirmed that simple tasks reduced from 2000 to 700 characters while preserving critical safety checks, achieving the goal of appropriate scaling without compromising quality control standards.

Adaptive System

I implemented an adaptive complexity system for the Claude Prompt Builder that addresses a critical issue where specialist agents weren't being effectively called for appropriate tasks. The system automatically analyzes user input to classify tasks as simple, medium, or complex, then generates appropriately scaled prompts — from concise 400-character responses for basic requests to comprehensive 2,500+ character structures for complex system design tasks. The core innovation was fixing the restrictive agent delegation logic that was preventing domain experts like security-engineer, python-engineer, and qa-engineer from being recommended when needed.

The implementation required building several new components including the file adaptive_prompt_builder.py (700+ lines), comprehensive configuration management, new API endpoints, and extensive testing frameworks. I maintained full backward compatibility while adding intelligent features like contextual agent triggers, fallback mechanisms, and configurable complexity thresholds. The system now successfully recommends 2+ relevant agents for medium complexity tasks and 5+ specialists with full orchestration for complex projects. Testing showed 100% accuracy in complexity detection and proper agent coordination across all scenarios, restoring the application's effectiveness in guiding users toward appropriate specialist assistance.

Prompt Builder

I'll be honest — I didn't set out to build a prompt engineering tool. Like many developers, I was spending way too much time crafting the perfect prompt for Claude, only to get responses that missed the mark. I'd write something vague like fix this bug and wonder why the AI couldn't read my mind. After watching myself and countless other developers struggle with this same frustration, I realized we needed a bridge between human intent and AI understanding. That's how the Prompt Builder was born — not from grand ambition, but from a simple desire to stop wasting time on prompt trial-and-error. I wanted to transform casual requests into structured, effective prompts that actually got the results we needed.

The architecture I settled on feels almost embarrassingly simple now, but it took several iterations to get right. At its core, the system implements Anthropic's six official prompt engineering techniques, wrapped in a Flask application that processes natural language through multiple enhancement layers. I built an Enhancement Intelligence system that prevents over-engineering simple requests — because nobody needs a 500-word prompt to change a font size. The breakthrough came when I introduced XML-style tag structure in v3.8.0, which creates clear instruction boundaries that dramatically improve how Claude parses complex prompts. I also integrated optional GPT-4o-mini enhancement as a pre-processing layer, essentially using one AI to help communicate better with another AI. The whole thing is held together with dependency injection, regex caching for performance, and a subagent orchestration system that automatically delegates specialized tasks to appropriate AI agents.

Building this tool taught me something unexpected about human-AI interaction: the gap isn't technical, it's communicative. I discovered that most bad AI responses aren't failures of the model, but failures in how we frame our requests. The biggest revelation was realizing that prompt engineering isn't just about getting better outputs — it's about forcing ourselves to think more clearly about what we actually want. When I watch developers use the prompt builder now, they often say the transformed prompt helped them understand their own requirements better. I'm particularly proud that the system has evolved from a simple text transformer into something that embeds Core Integrity Principles — accuracy, professional honesty, and thorough testing — into every generated prompt. It's a small way to make AI interactions more reliable and trustworthy. Honestly, I never expected a side project about prompts to teach me so much about clear communication and systematic thinking.

Reading

This week I am reading Superintelligence: Paths, Dangers, Strategies by Nick Bostrom.

Typography Refinements

I spent some time today refining the typography on this blog after noticing that my section headers and article titles were competing for attention. They had identical visual weight, which made it difficult to distinguish between navigation elements and actual content.

The solution turned out to be fairly straightforward. Rather than making everything lighter, I implemented a progressive weight scale using the Commissioner variable font's range. Article titles remain at weight 600 but are now larger, while subheadings step down to 550, section headers to 500, and supporting elements to 450. This creates a natural reading flow that guides the eye through the content hierarchy.

I also increased the spacing around article titles and tightened their line height for better multi-line rendering. The section headers now have a subtle opacity reduction to further distinguish them from main content.

These changes feel like a meaningful improvement to me. The text no longer fights for attention, and there's a clearer sense of structure when scanning through posts. Sometimes the smallest adjustments can make a significant difference in how content is perceived and consumed.

Frederic Malle - The Night (Again)

Having been disappointed with my first sample, I recently ordered another from a trusted online retailer, thinking perhaps the original had been a faulty batch or somehow diluted.

Unfortunately, this new sample smells almost identical to the first. There is no oud to speak of — just a lingering rose note. For me, the oud simply isn’t present, and I cannot understand how this fragrance has gained such a reputation as an oud powerhouse.

In any case, I will not be purchasing a full bottle. Unless the day comes when I can thoroughly test it in a department store, it will remain a fragrance I have no intention of buying.

Bleaklow by The Stranger

James Leyland Kirby — better known for his haunting work as The Caretaker — ventures into darker territory with 'Bleaklow,' an album that trades his usual deteriorating ballroom memories for the windswept desolation of Northern England's moorlands. This isn't the interior decay of abandoned hotels that The Caretaker explores, but rather the outer spaces where humans can only visit, terrains that exist beyond habitation. The opening track 'Something to Do with Death' sets the tone immediately with unstable, fizzing drones punctuated by pained, high-pitched howls that transmit waves of genuine dread. Throughout its eleven tracks, Kirby masterfully manipulates sound into intimidating monstrosities — formidable sonics and fearsome drones that map out a sonic cartography of the Eerie North West, from the titular Bleaklow plateau to the subterranean cave systems and crags that dot these lonely landscapes.

What makes 'Bleaklow' such a unique jewel in Kirby's extensive catalogue is its complete departure from his sample-based turntable work, instead crafting original dark ambient compositions that feel simultaneously deeply unsettling and oddly comforting. Tracks like 'Indefinite Ridge' create that destabilizing sound that seems to pull you into the moorland itself, while 'Kirkbymoorside' and 'Ominous Sunset' build layers of industrial ambience that feel both majestic and oppressively lugubre. The 2014 remastered edition — with Matt Colton's pristine vinyl mastering and Ivan Seal's new artwork — only emphasizes the album's power, revealing new depths in these already cavernous soundscapes. As critic Mark Fisher noted, Kirby should be understood more as an artist than merely a musician, and 'Bleaklow' stands as testament to this vision — a work that shames the conceptual poverty of much contemporary art through its commitment to place, atmosphere, and the haunting presence of landscapes that remember their dead.