Files
page-agent/ROADMAP.md
2025-10-24 17:50:57 +08:00

1.9 KiB

🗺️ PageAgent Roadmap

The development progress and future plans for PageAgent.

🚀 Current Works

  • MVP
    • Core functionality implemented.
  • SPA interaction
  • Reasoning and (short) memory
  • Multi model provider integration and testing
  • UI with HITL
    • Human-in-the-loop user interface. Agent can ask user questions.
  • Landing and doc pages
  • Remove ai-sdk
    • Only one function of AI-ADK is being used.
    • Our agent memory and thinking mechanism does not suite ai-sdk.
  • Robust LLM output
    • Auto-fix incomplete output format of DeepSeek and QWen.
  • Working homepage with live LLM API
  • free CDN
  • Free evaluation plan
  • Custom actions and HITL
  • Hooks and Events
    • lifecycle hooks
    • lifecycle events
  • Pause and intervene
    • Current pause does not allow user intervention.
  • Hijack page_open/page_change/page_unload behavior
  • Custom knowledge base and instructions
  • Black/white-list safeguard
  • Data-masking
  • Improve Memory
    • Current phrasing can cause logic-loop for some models.
    • Test adding Action to memory.
  • Tools for more complex tasks
    • todo list
    • file sys
  • Optimize for popular UI frameworks
  • i18n of the website
    • Chinese version
    • English version
  • Testing suits
  • Support custom llm fetch
  • Refactor: Separate Agent and Page-Controller
    • Agent should be able to run w/o dom.
    • Actions should be able to be injected through iframe.

♻️ Following browser-use's update and contribute back.

📋 Pending Features

  • Chrome-ext wrapper
  • Same-origin multi-page-app rally
  • Local MCP proxy

🤔 To Be Decided

  • Cross-origin multi-page?
    • Tricky
    • Need some kind of "memory rally"