I've been wanting to do terminal recordings via LLMs for testing my projects and dropping in videos etc to PRs - this is like Playwright but for the shell. Would love feedback. Recording the LLM closing Vim was satisfying.
LLMs have been helping me code more rapidly but are instucted at the system level to often be overly helpful, making changes without discussing, adding code withotut removing stale code, trying to anticipate future needs and so on.
You can prompt your LLM or use the MCP server to get it to read this guide that instructs it to follow a 'plan / implement / review' cycle, and has some common patterns and stanards that should be near universal.
I've been using this for a few months and it's greatly improved my productivity, but would love any suggestions.
"The biggest lesson that can be read from 70 years of AI research is that general methods that leverage computation are ultimately the most effective, and by a large margin." - Richard S. Sutton (2019)
For those closer to the edge of research in AI, especially over the last five years, I'd love to know whether you believe this to be valid and whether it is still the case.