About 52 results
Open links in new tab
  1. GitHub - Tongyi-MAI/MAI-UI: MAI-UI: Real-World Centric Foundation …

    Dec 29, 2025 · MAI-UI Mobile is a foundation GUI agent developed by Alibaba Cloud and licensed under the Apache License (Version 2.0). This product contains various third-party components under other …

  2. GitHub - clash-verge-rev/clash-verge-rev: A modern GUI client based …

    A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience - clash-verge-rev/clash-verge-rev

  3. showlab/Awesome-GUI-Agent - GitHub

    💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents. - showlab/Awesome-GUI-Agent

  4. GitHub - ritzz-ai/GUI-R1: Official implementation of GUI-R1 : A ...

    Mar 10, 2025 · By leveraging a small amount of carefully curated high-quality data across multiple platforms (including Windows, Linux, MacOS, Android, and Web) and employing policy optimization …

  5. GitHub - microsoft/GUI-Actor: [NeurIPS'25] GUI-Actor: Coordinate-Free ...

    Jun 3, 2025 · The attention-based action head not only enables GUI-Actor to perform coordinate-free GUI grounding that more closely aligns with human behavior, but also can generate multiple …

  6. GitHub - stepfun-ai/gelab-zero: STEP-GUI: The top GUI agent solution …

    Nov 30, 2025 · While GUI-based solutions offer universal compatibility, the fragmentation of mobile ecosystems imposes heavy engineering burdens that hinder innovation. GELab-Zero is designed to …

  7. GitHub - OpenBMB/AgentCPM-GUI: AgentCPM-GUI: An on-device GUI …

    May 13, 2025 · AgentCPM-GUI is an open-source on-device LLM agent model jointly developed by THUNLP, Renmin University of China and ModelBest. Built on MiniCPM-V with 8 billion parameters, …

  8. Scrcpy GUI - GitHub

    Scrcpy was created by the team behind the popular Android emulator Genymotion, but it is not an Android emulator itself, it displays and controls Android devices connected via USB or TCP/IP, it …

  9. Pioneering Automated GUI Interaction with Native Agents

    Recommended for: GUI tasks on desktop environments such as Windows, Linux, or macOS. Features: Supports common desktop operations: mouse clicks (single, double, right), drag actions, keyboard …

  10. GitHub - PDFMathTranslate/PDFMathTranslate: [EMNLP 2025 Demo] …

    Scientific PDF document translation preserving layouts. 📊 Preserve formulas, charts, table of contents, and annotations. 🌐 Support multiple languages, and diverse translation services. 🤖 Provides …