The solution exists in the financial controls world. Agent = drafter, human = approver. The challenge is very few applications are designed to allow this, Amazon's 1-click checkout is the exact opposite. Writing a proxy for each individual app you give it access to and shimming in your own line of what the agent can do and what it needs approval is a complex and brittle solution.
Yes very funny to see their own model betray them like this:
> The original “non‑profit, open, patents shared” promise now reads almost like an alternate timeline. Today OpenAI is a capped‑profit entity with a massive corporate partner, closed frontier models, and an aggressive product roadmap.
Page 5, "The knowledge cutoff date for Gemini 3 Pro was January 2025."
Still taking nearly a year to train and run post training safety and stability tuning.
With 10x the infrastructure they could iterate much faster, I don't see AI infrastructure as a bubble, it is still a bottleneck on pace of innovation at today's active deployment level.
I'm pretty much doing that in a containerized deployment for project I'm looking to open source soon called webbin:
Technology Stack
Frontend: React 18 + TypeScript + Vite + TailwindCSS
Backend: Node.js + Express + Clustering
Database: PostgreSQL 16 with performance optimization
Cache: Redis 7 with active defragmentation
Security: HTTPS/TLS with container-to-container encryption
Orchestration: Docker Compose with health checks
Monitoring: Built-in APM and performance tracking
Services
webbin-frontend - React TypeScript frontend with HTTPS
webbin-backend - Node.js API with clustering support
webbin-postgres - PostgreSQL 16 with performance tuning
webbin-redis - Redis 7 with advanced caching
webbin-certbot - SSL Certificate management, openssl for dev, LE for production
webbin-testrunner - Testbot
webbin-nginx - Proxy
Access Points
Frontend: https://localhost:5173
Backend API: https://localhost:3001
Health Check: https://localhost:3001/api/health
I'm around 15k LOC, all built in ~80 hours of interactive prompting mostly with Claude 4.0 Sonnet, then some Gemini 2.5 Pro for more devops activities.
"An article by Krebs on 27 March 2018 on KrebsOnSecurity.com about the mining software company and script "Coinhive" where Krebs published the names of admins of the German imageboard pr0gramm, as a former admin is the inventor of the script and owner of the company, was answered by an unusual protest action by the users of that imageboard. Using the pun of "Krebs" meaning "Cancer" in German, they donated to charitable organisations fighting against those diseases, collecting more than 200,000 Euro of donations until the evening of 28 March to the Deutsche Krebshilfe charity".
I believe we'll see the benefits and drawbacks of AI augmentation to humans performing various tasks will vary wildly based on the task, the way the AI is being asked to interact, and the AI model.
English football, unlike tic-tac-toe, can be thrilling and end in a draw. Possession mix, shots on goal, and more stats are useful to determine how exciting a match was from a box score.
Frankly, for me the most boring is a 2-0 win where the team scores those 2 in the first 20 to 30 minutes, swaps to a 5-4-1, and plays tiki-taka passing possession control without trying hard to advance the ball for the remaining hour of the match.
I'm running terminal in one window with AI interaction and then VS Code with project on same directories so I can see via color coding updated or new files to review in the IDE.
How is that better than running your AI interaction in a dedicated toolpane/subwindow directly inside your IDE?
The Chat panel in VS Code has seen a lot of polish, can display full HTML including formatting Markdown nicely, has some fancy displays for AI context such as file links, supports hyperlinks everywhere, and has fancy auto-complete popups for things like @ and # and / mentioned "tools"/"agents"/whatever. Other VS Code widgets can show up in the Chat panel, too. The Chat Panel you can dock in either sidebar and/or float as its own window.
A terminal can do most of those things too, with effort and with nothing quite like the native experience of your IDE and its widgets. It seems like a lesser experience than what VS Code already offers, other than you only have one real choice for AI assistant that supports VS Code's Chat panel (though you still have model choice).
The screenshot of email from DCIO is what should be getting rolled out. This is not suspicious by itself from my perspective. SCuBA is a CISA project that improves security.