Stage Conclusion: UI-Tars + RAG = A New Approach for Automated E2E & Exploratory Testing (Part 3)

Articles in this series: Articles in this series: Part 1 - Practical Applications of AI in Test Automation — Context, Demo with UI-Tars LLM & Midscene Part 2 - Data: UI-Tars VS GPT-4o in Midscene Part 3 - Stage Conclusion: UI-Tars + RAG = A New Approach for Automated E2E & Exploratory Testing This is the final article in this series - I will present an example demonstrating: How to integrate UI-Tars' system-reasoning 2 with our locally built RAG using Ollama and LangChain to create a system that understands high-level user instructions and automates execution based on browser screenshots at each stage. Verify the capability of system-2 reasoning after combining the UI-Tars & local RAG 1. End-to-End Demo This Demo uses Miro as an example to demonstrate the capability to handle a non-B2C & complicate system, the AI Agent follows a single user's instruction: "Create a new board with 2 sticky notes and link these 2 sticky notes by a line. " ⚠️VERY IMPORTANT⚠️: The Demo uses Miro's Free Plan, which is allowed for everyone. The test is executed less than 10 times, to verify the stability. The authentication part uses my personal Miro Free account(hardcoded already to avoid any other risks), I strongly ask any readers who want to reproduce this test for any customer-facing products should NOT impact the normal usage of the product, and MUST follow up the policy of the product respectively. (

Feb 26, 2025 - 11:14
 0
Stage Conclusion: UI-Tars + RAG = A New Approach for Automated E2E & Exploratory Testing (Part 3)

Articles in this series:
Articles in this series:

This is the final article in this series -

I will present an example demonstrating:

  • How to integrate UI-Tars' system-reasoning 2 with our locally built RAG using Ollama and LangChain to create a system that understands high-level user instructions and automates execution based on browser screenshots at each stage.
  • Verify the capability of system-2 reasoning after combining the UI-Tars & local RAG

1. End-to-End Demo

This Demo uses Miro as an example to demonstrate the capability to handle a non-B2C & complicate system, the AI Agent follows a single user's instruction: "Create a new board with 2 sticky notes and link these 2 sticky notes by a line. "

⚠️VERY IMPORTANT⚠️: The Demo uses Miro's Free Plan, which is allowed for everyone. The test is executed less than 10 times, to verify the stability. The authentication part uses my personal Miro Free account(hardcoded already to avoid any other risks), I strongly ask any readers who want to reproduce this test for any customer-facing products should NOT impact the normal usage of the product, and MUST follow up the policy of the product respectively.


(