1.6 KiB
1.6 KiB
🗺️ PageAgent Roadmap
The development progress and future plans for PageAgent.
🚀 Current Works
- MVP
- Core functionality implemented.
- SPA interaction
- Reasoning and (short) memory
- Multi model provider integration and testing
- UI with HITL
- Human-in-the-loop user interface. Agent can ask user questions.
- Landing and doc pages
- Remove ai-sdk
- Only one function of AI-ADK is being used.
- Our agent memory and thinking mechanism does not suite ai-sdk.
- Robust LLM output
- Auto-fix incomplete output format of DeepSeek and QWen.
- Working homepage with live LLM API
freeCDN- Free evaluation plan
- Custom actions and HITL
- Hooks and Events
- lifecycle hooks
- lifecycle events
- ❗Pause and intervene
- Current pause does not allow user intervention.
- ❗Hijack
page_open/page_change/page_unloadbehavior - Custom knowledge base and instructions
- Black/white-list safeguard
- Data-masking
- Optimize for popular UI frameworks
- Testing suits
- Support custom llm fetch
- Refactor: Separate Agent and Page-Controller
- Agent should be able to run w/o dom.
- Actions should be able to be injected through iframe.
♻️ Following browser-use's update and contribute back.
📋 Pending Features
- Chrome-ext wrapper
- Same-origin multi-page-app rally
- Local MCP proxy
🤔 To Be Decided
- Cross-origin multi-page?
- Tricky
- Need some kind of "memory rally"