guicountrolControl the Linux desktop GUI using xdotool, wmctrl, and dogtail. Use when you need to interact with non-browser applications, simulate mouse/keyboard input, manage windows, or inspect the UI hierarchy of applications on X11/GNOME. Supports: (1) Clicking/typing in apps, (2) Resizing/moving windows, (3) Extracting text-based UI trees from apps (A11y), (4) Taking screenshots for visual analysis.
Install via ClawdBot CLI:
clawdbot install dreamtraveler13/guicountrolThis skill provides tools and procedures for automating interactions with the Linux desktop environment.
Use wmctrl to find the exact name of the window you want to control.
wmctrl -l
For apps supporting accessibility (GNOME apps, Electron apps with --force-renderer-accessibility), use the inspection script to find button names without taking screenshots.
python3 scripts/inspect_ui.py "<app_name>"
Use xdotool via the helper script for common actions.
# Activate window
./scripts/gui_action.sh activate "<window_name>"
# Click coordinates
./scripts/gui_action.sh click 500 500
# Type text
./scripts/gui_action.sh type "Hello World"
# Press a key
./scripts/gui_action.sh key "Return"
wmctrl -l.scripts/inspect_ui.py to get the list of buttons and inputs.xdotool key Tab and Return to navigate, or click if coordinates are known.Many modern apps (VS Code, Discord, Cider, Chrome) need a flag to expose their UI tree:
pkill <app>
nohup <app> --force-renderer-accessibility > /dev/null 2>&1 &
Generated Mar 1, 2026
This scenario involves using the skill to automate functional testing of Linux desktop applications by simulating user interactions like clicking buttons, typing text, and verifying UI elements. It is ideal for QA teams to run regression tests on non-browser apps such as office suites or media players, reducing manual effort and improving test coverage.
Here, the skill extracts UI hierarchies via dogtail to audit applications for accessibility standards like WCAG, ensuring they are usable with assistive technologies. It helps organizations in sectors like government or education comply with regulations by automatically identifying missing labels or navigational issues in GNOME or Electron apps.
This scenario enables IT support teams to remotely control and troubleshoot Linux desktops by automating tasks such as window management, input simulation, and screenshot capture. It can be used to resolve user issues in corporate environments without physical access, enhancing efficiency for helpdesk operations.
The skill automates repetitive data entry tasks by typing text and navigating forms in legacy or custom Linux applications, such as inventory systems or administrative tools. It benefits industries like logistics or retail by reducing human error and speeding up data processing workflows.
In this scenario, designers and developers use the skill to simulate user interactions and capture screenshots for visual analysis during UI prototyping. It aids in testing navigation flows and layout changes in applications like design tools or media editors, facilitating iterative design improvements.
Offer a cloud-based service that integrates this skill into a testing platform, allowing customers to automate GUI tests for Linux applications via a web interface. Revenue is generated through subscription tiers based on usage volume and support levels, targeting software companies and QA teams.
Provide consulting services to businesses needing custom automation scripts for specific Linux applications, such as legacy systems or proprietary software. Revenue comes from project-based fees and ongoing maintenance contracts, serving industries like finance or manufacturing with unique workflow requirements.
Distribute the skill as open-source while offering enterprise licenses with advanced features, priority support, and integration assistance. Revenue is generated from license sales and support packages, targeting large organizations that require reliable, scalable automation for critical operations.
💬 Integration Tip
Ensure the Linux environment has X11 running and necessary tools like xdotool installed; for Electron apps, use the --force-renderer-accessibility flag to enable UI inspection.
Automatically update Clawdbot and all installed skills once daily. Runs via cron, checks for updates, applies them, and messages the user with a summary of what changed.
Full desktop computer use for headless Linux servers. Xvfb + XFCE virtual desktop with xdotool automation. 17 actions (click, type, scroll, screenshot, drag,...
Essential Docker commands and workflows for container management, image operations, and debugging.
Tool discovery and shell one-liner reference for sysadmin, DevOps, and security tasks. AUTO-CONSULT this skill when the user is: troubleshooting network issues, debugging processes, analyzing logs, working with SSL/TLS, managing DNS, testing HTTP endpoints, auditing security, working with containers, writing shell scripts, or asks 'what tool should I use for X'. Source: github.com/trimstray/the-book-of-secret-knowledge
Deploy applications and manage projects with complete CLI reference. Commands for deployments, projects, domains, environment variables, and live documentation access.
Monitor topics of interest and proactively alert when important developments occur. Use when user wants automated monitoring of specific subjects (e.g., product releases, price changes, news topics, technology updates). Supports scheduled web searches, AI-powered importance scoring, smart alerts vs weekly digests, and memory-aware contextual summaries.