Published August 11, 2025 · Tags: AIWorkflow, Automation, RedTeaming
Three days, one AI assistant, and a dozen unexpected outcomes. From automating my home, to unfamiliar food
prep, to building a red-teaming prototype for testing AI safety, these rapid experiments show how versatile
AI can be when you work alongside it with intention.
Read more →
Published July 22, 2025 · Tags: PromptEngineering, GPT-4o, Experiment
What happens when a language model takes on a logic puzzle like Wordle? This experiment runs GPT-4o through
a ReAct scaffold and shows where its reasoning holds, and where its pattern-following snaps. The breaks
matter. They explain why oversight and structure still decide whether AI helps or harms.
Read more →
Published July 02, 2025 · Tags: PromptEngineering, Ethics, GPT-4o, Experiment
How does long-term context change an AI’s moral judgment? We tested GPT-4o with a trolley problem under
different identities and framing. You can watch it bend. Stakes shift when the model is allowed to build a
memory of you.
Read more →
Published April 28, 2025 · Tags: PromptEngineering, LLMs, ReAct
ReAct looks great in papers. We wanted to know if it survives reality, especially on local models. This
walkthrough covers setup, behavior under pressure, and what “reasoning” really means when the model is on
your own hardware.
Read more →
Published April 21, 2025 · Tags: PromptEngineering, LLMs, Reflection
This is the why. The gap between “we rolled out AI” and “our team can actually use it.” What it felt like
being in Support without a playbook, and why Shinros is built around trust, not novelty.
Read more →