Official Skills Catalog (173)

A direct enumeration of the official repo's skills directories: 72 bundled plus 101 official optional skills. Descriptions come from each skill's SKILL.md; every entry links to its official source. Not sure where to start? See the beginner recommendations. Last checked: 2026-07-04.

Bundled Skills (72)

apple

apple-notes Manage Apple Notes via memo CLI: create, search, edit.
apple-reminders Apple Reminders via remindctl: add, list, complete.
findmy Track Apple devices/AirTags via FindMy.app on macOS.
imessage Send and receive iMessages/SMS via the imsg CLI on macOS.

autonomous-ai-agents

claude-code Delegate coding to Claude Code CLI (features, PRs).
codex Delegate coding to OpenAI Codex CLI (features, PRs).
hermes-agent Configure, extend, or contribute to Hermes Agent.
opencode Delegate coding to OpenCode CLI (features, PR review).

computer-use

computer-use |

creative

architecture-diagram Dark-themed SVG architecture/cloud/infra diagrams as HTML.
ascii-art ASCII art: pyfiglet, cowsay, boxes, image-to-ascii.
ascii-video ASCII video: convert video/audio to colored ASCII MP4/GIF.
baoyu-infographic Infographics: 21 layouts x 21 styles (信息图, 可视化).
claude-design Design one-off HTML artifacts (landing, deck, prototype).
comfyui Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution.
design-md Author/validate/export Google's DESIGN.md token spec files.
excalidraw Hand-drawn Excalidraw JSON diagrams (arch, flow, seq).
humanizer Humanize text: strip AI-isms and add real voice.
manim-video Manim CE animations: 3Blue1Brown math/algo videos.
p5js p5.js sketches: gen art, shaders, interactive, 3D.
popular-web-designs 54 real design systems (Stripe, Linear, Vercel) as HTML/CSS.
pretext Use when building creative browser demos with @chenglou/pretext — DOM-free text layout for ASCII art, typographic flow around obstacles, text-as-geometry games, kinetic typography, and text-powered generative art. Produces single-file HTML demos by default.
sketch Throwaway HTML mockups: 2-3 design variants to compare.
songwriting-and-ai-music Songwriting craft and Suno AI music prompts.
touchdesigner-mcp Control a running TouchDesigner instance via twozero MCP — create operators, set parameters, wire connections, execute Python, build real-time visuals. 36 native tools.

data-science

jupyter-live-kernel Iterative Python via live Jupyter kernel (hamelnb).

dogfood

dogfood Exploratory QA of web apps: find bugs, evidence, reports.

email

himalaya Himalaya CLI: IMAP/SMTP email from terminal.

github

codebase-inspection Inspect codebases w/ pygount: LOC, languages, ratios.
github-auth GitHub auth setup: HTTPS tokens, SSH keys, gh CLI login.
github-code-review Review PRs: diffs, inline comments via gh or REST.
github-issues Create, triage, label, assign GitHub issues via gh or REST.
github-pr-workflow GitHub PR lifecycle: branch, commit, open, CI, merge.
github-repo-management Clone/create/fork repos; manage remotes, releases.

media

gif-search Search/download GIFs from Tenor via curl + jq.
heartmula HeartMuLa: Suno-like song generation from lyrics + tags.
songsee Audio spectrograms/features (mel, chroma, MFCC) via CLI.
youtube-content YouTube transcripts to summaries, threads, blogs.

mlops

lm-evaluation-harness lm-eval-harness: benchmark LLMs (MMLU, GSM8K, etc.).
weights-and-biases W&B: log ML experiments, sweeps, model registry, dashboards.
huggingface-hub HuggingFace hf CLI: search/download/upload models, datasets.
llama-cpp llama.cpp local GGUF inference + HF Hub model discovery.
vllm vLLM: high-throughput LLM serving, OpenAI API, quantization.
audiocraft AudioCraft: MusicGen text-to-music, AudioGen text-to-sound.
segment-anything SAM: zero-shot image segmentation via points, boxes, masks.

note-taking

obsidian Read, search, create, and edit notes in the Obsidian vault.

productivity

airtable Airtable REST API via curl. Records CRUD, filters, upserts.
google-workspace Gmail, Calendar, Drive, Docs, Sheets via gws CLI or Python.
maps Geocode, POIs, routes, timezones via OpenStreetMap/OSRM.
nano-pdf Edit PDF text/typos/titles via nano-pdf CLI (NL prompts).
notion Notion API + ntn CLI: pages, databases, markdown, Workers.
ocr-and-documents Extract text from PDFs/scans (pymupdf, marker-pdf).
petdex Install and select animated petdex mascots for Hermes.
powerpoint Create, read, edit .pptx decks, slides, notes, templates.
teams-meeting-pipeline Operate the Teams meeting summary pipeline via Hermes CLI — summarize meetings, inspect pipeline status, replay jobs, manage Microsoft Graph subscriptions.

research

arxiv Search arXiv papers by keyword, author, category, or ID.
blogwatcher Monitor blogs and RSS/Atom feeds via blogwatcher-cli tool.
llm-wiki Karpathy's LLM Wiki: build/query interlinked markdown KB.
polymarket Query Polymarket: markets, prices, orderbooks, history.
research-paper-writing Write ML papers for NeurIPS/ICML/ICLR: design→submit.

smart-home

openhue Control Philips Hue lights, scenes, rooms via OpenHue CLI.

social-media

xurl X/Twitter via xurl CLI: post, search, DM, media, v2 API.

software-development

hermes-agent-skill-authoring Author in-repo SKILL.md: frontmatter, validator, structure, and writing-quality principles.
node-inspect-debugger Debug Node.js via --inspect + Chrome DevTools Protocol CLI.
plan Plan mode: write an actionable markdown plan to .hermes/plans/, no execution. Bite-sized tasks, exact paths, complete code.
python-debugpy Debug Python: pdb REPL + debugpy remote (DAP).
requesting-code-review Pre-commit review: security scan, quality gates, auto-fix.
simplify-code Parallel 3-agent cleanup of recent code changes.
spike Throwaway experiments to validate an idea before build.
systematic-debugging 4-phase root cause debugging: understand bugs before fixing.
test-driven-development TDD: enforce RED-GREEN-REFACTOR, tests before code.

yuanbao

yuanbao Yuanbao (元宝) groups: @mention users, query info/members.

Official Optional Skills (101)

autonomous-ai-agents

antigravity-cli Operate the Antigravity CLI (agy): plugins, auth, sandbox.
blackbox Delegate coding tasks to Blackbox AI CLI agent. Multi-model agent with built-in judge that runs tasks through multiple LLMs and picks the best result. Requires the blackbox CLI and a Blackbox AI API key.
grok Delegate coding to xAI Grok Build CLI (features, PRs).
honcho Configure and use Honcho memory with Hermes -- cross-session user modeling, multi-profile peer isolation, observation config, dialectic reasoning, session summaries, and context budget enforcement. Use when setting up Honcho, troubleshooting memory, managing profiles with Honcho peers, or tuning obs
openhands Delegate coding to OpenHands CLI (model-agnostic, LiteLLM).

blockchain

evm Read-only EVM client: wallets, tokens, gas across 8 chains.
hyperliquid Hyperliquid market data, account history, trade review.
solana Query Solana blockchain data with USD pricing — wallet balances, token portfolios with values, transaction details, NFTs, whale detection, and live network stats. Uses Solana RPC + CoinGecko. No API key required.

communication

one-three-one-rule >

creative

baoyu-article-illustrator Article illustrations: type × style × palette consistency.
baoyu-comic Knowledge comics (知识漫画): educational, biography, tutorial.
blender-mcp Control Blender directly from Hermes via socket connection to the blender-mcp addon. Create 3D objects, materials, animations, and run arbitrary Blender Python (bpy) code. Use when user wants to create or modify anything in Blender.
concept-diagrams Generate flat, minimal light/dark-aware SVG diagrams as standalone HTML files, using a unified educational visual language with 9 semantic color ramps, sentence-case typography, and automatic dark mode. Best suited for educational and non-software visuals — physics setups, chemistry mechanisms, math
creative-ideation Generate ideas via named methods from creative practice.
hyperframes Create HTML-based video compositions, animated title cards, social overlays, captioned talking-head videos, audio-reactive visuals, and shader transitions using HyperFrames. HTML is the source of truth for video. Use when the user wants a rendered MP4/WebM from an HTML composition, wants to animate
kanban-video-orchestrator Plan, set up, and monitor a multi-agent video production pipeline backed by Hermes Kanban. Use when the user wants to make ANY video — narrative film, product/marketing, music video, explainer, ASCII/terminal art, abstract/generative loop, comic, 3D, real-time/installation — and the work warrants de
meme-generation Generate real meme images by picking a template and overlaying text with Pillow. Produces actual .png meme files.
pixel-art Pixel art w/ era palettes (NES, Game Boy, PICO-8).

devops

cli Run 150+ AI apps via inference.sh CLI (infsh) — image generation, video creation, LLMs, search, 3D, social automation. Uses the terminal tool. Triggers: inference.sh, infsh, ai apps, flux, veo, image generation, video generation, seedream, seedance, tavily
docker-management Manage Docker containers, images, volumes, networks, and Compose stacks — lifecycle ops, debugging, cleanup, and Dockerfile optimization.
hermes-s6-container-supervision Modify, debug, or extend the s6-overlay supervision tree inside the Hermes Agent Docker image — adding new services, debugging profile gateways, understanding the Architecture B main-program pattern.
pinggy-tunnel Zero-install localhost tunnels over SSH via Pinggy.
watchers Poll RSS, JSON APIs, and GitHub with watermark dedup.

dogfood

adversarial-ux-test Roleplay the most difficult, tech-resistant user for your product. Browse the app as that persona, find every UX pain point, then filter complaints through a pragmatism layer to separate real problems from noise. Creates actionable tickets from genuine issues only.

email

agentmail Give the agent its own dedicated email inbox via AgentMail. Send, receive, and manage email autonomously using agent-owned email addresses (e.g. [email protected]).

finance

3-statement-model Build fully-integrated 3-statement models (IS, BS, CF) in Excel with working capital schedules, D&A roll-forwards, debt schedule, and the plugs that make cash and retained earnings tie. Pairs with excel-author.
comps-analysis Build comparable company analysis in Excel — operating metrics, valuation multiples, statistical benchmarking vs peer sets. Pairs with excel-author. Use for public-company valuation, IPO pricing, sector benchmarking, or outlier detection.
dcf-model Build institutional-quality DCF valuation models in Excel — revenue projections, FCF build, WACC, terminal value, Bear/Base/Bull scenarios, 5x5 sensitivity tables. Pairs with excel-author. Use for intrinsic-value equity analysis.
excel-author Build auditable Excel workbooks headless with openpyxl — blue/black/green cell conventions, formulas over hardcodes, named ranges, balance checks, sensitivity tables. Use for financial models, audit outputs, reconciliations.
lbo-model Build leveraged buyout models in Excel — sources & uses, debt schedule, cash sweep, exit multiple, IRR/MOIC sensitivity. Pairs with excel-author. Use for PE screening, sponsor-case valuation, or illustrative LBO in a pitch.
merger-model Build accretion/dilution (merger) models in Excel — pro-forma P&L, synergies, financing mix, EPS impact. Pairs with excel-author. Use for M&A pitches, board materials, or deal evaluation.
pptx-author Build PowerPoint decks headless with python-pptx. Pairs with excel-author for model-backed decks where every number traces to a workbook cell. Use for pitch decks, IC memos, earnings notes.
stocks Stock quotes, history, search, compare, crypto via Yahoo.

gaming

minecraft-modpack-server Host modded Minecraft servers (CurseForge, Modrinth).
pokemon-player Play Pokemon via headless emulator + RAM reads.

health

mcp

fastmcp Build, test, inspect, install, and deploy MCP servers with FastMCP in Python. Use when creating a new MCP server, wrapping an API or database as MCP tools, exposing resources or prompts, or preparing a FastMCP server for Claude Code, Cursor, or HTTP deployment.
mcporter Use the mcporter CLI to list, configure, auth, and call MCP servers/tools directly (HTTP or stdio), including ad-hoc servers, config edits, and CLI/type generation.

migration

openclaw-migration Migrate a user's OpenClaw customization footprint into Hermes Agent. Imports Hermes-compatible memories, SOUL.md, command allowlists, user skills, and selected workspace assets from ~/.openclaw, then reports exactly what could not be migrated and why.

mlops

accelerate Simplest distributed training API. 4 lines to add distributed support to any PyTorch script. Unified API for DeepSpeed/FSDP/Megatron/DDP. Automatic device placement, mixed precision (FP16/BF16/FP8). Interactive config, single launch command. HuggingFace ecosystem standard.
chroma Open-source embedding database for AI applications. Store embeddings and metadata, perform vector and full-text search, filter by metadata. Simple 4-function API. Scales from notebooks to production clusters. Use for semantic search, RAG applications, or document retrieval. Best for local developmen
clip OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search, content moderation, or vision-language tasks without fine-tuning. Best for general-purpose image understandin
faiss Facebook's library for efficient similarity search and clustering of dense vectors. Supports billions of vectors, GPU acceleration, and various index types (Flat, IVF, HNSW). Use for fast k-NN search, large-scale vector retrieval, or when you need pure similarity search without metadata. Best for hi
flash-attention Optimizes transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Use when training/running transformers with long sequences (>512 tokens), encountering GPU memory issues with attention, or need faster inference. Supports PyTorch native SDPA, flash-attn library, H100
guidance Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance - Microsoft Research's constrained generation framework
huggingface-tokenizers Fast tokenizers optimized for research and production. Rust-based implementation tokenizes 1GB in <20 seconds. Supports BPE, WordPiece, and Unigram algorithms. Train custom vocabularies, track alignments, handle padding/truncation. Integrates seamlessly with transformers. Use when you need high-perf
outlines Outlines: structured JSON/regex/Pydantic LLM generation.
instructor Extract structured data from LLM responses with Pydantic validation, retry failed extractions automatically, parse complex JSON with type safety, and stream partial results with Instructor - battle-tested structured output library
lambda-labs Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node clusters for large-scale training.
llava Large Language and Vision Assistant. Enables visual instruction tuning and image-based conversations. Combines CLIP vision encoder with Vicuna/LLaMA language models. Supports multi-turn image chat, visual question answering, and instruction following. Use for vision-language chatbots or image unders
modal Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.
nemo-curator GPU-accelerated data curation for LLM training. Supports text/image/video/audio. Features fuzzy deduplication (16× faster), quality filtering (30+ heuristics), semantic deduplication, PII redaction, NSFW detection. Scales across GPUs with RAPIDS. Use for preparing high-quality training datasets, cle
obliteratus OBLITERATUS: abliterate LLM refusals (diff-in-means).
peft Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transform
pinecone Managed vector database for production AI applications. Fully managed, auto-scaling, with hybrid search (dense + sparse), metadata filtering, and namespaces. Low latency (<100ms p95). Use for production RAG, recommendation systems, or semantic search at scale. Best for serverless, managed infrastruc
pytorch-fsdp Expert guidance for Fully Sharded Data Parallel training with PyTorch FSDP - parameter sharding, mixed precision, CPU offloading, FSDP2
pytorch-lightning High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from laptop to supercomputer with same code. Use when you want clean training loops with built-in best practices.
qdrant High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.
dspy DSPy: declarative LM programs, auto-optimize prompts, RAG.
saelens Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.
simpo Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points on AlpacaEval 2.0). No reference model needed, more efficient than DPO. Use for preference alignment when want simpler, faster training than DPO/PPO.
slime Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation workflows, or needing tight Megatron-LM integration for RL scaling.
stable-diffusion State-of-the-art text-to-image generation with Stable Diffusion models via HuggingFace Diffusers. Use when generating images from text prompts, performing image-to-image translation, inpainting, or building custom diffusion pipelines.
tensorrt-llm Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than PyTorch, or for serving models with quantization (FP8/INT4), in-flight batching, and multi-GPU scaling.
torchtitan Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use when pretraining Llama 3.1, DeepSeek V3, or custom models at scale from 8 to 512+ GPUs with Float8, torch.compile, and distributed checkpointing.
axolotl Axolotl: YAML LLM fine-tuning (LoRA, DPO, GRPO).
trl-fine-tuning TRL: SFT, DPO, PPO, GRPO, reward modeling for LLM RLHF.
unsloth Unsloth: 2-5x faster LoRA/QLoRA fine-tuning, less VRAM.
whisper OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robu

payments

mpp-agent Pay HTTP 402 APIs via Machine Payments Protocol (MPP).
stripe-link-cli Agent payments via Stripe Link — cards, SPT, approvals.
stripe-projects Provision SaaS services + sync creds via Stripe Projects.

productivity

canvas Canvas LMS integration — fetch enrolled courses and assignments using API token authentication.
here-now Publish static sites to {slug}.here.now and store private files in cloud Drives for agent-to-agent handoff.
memento-flashcards >-
shop Shop catalog search, checkout, order tracking, returns.
shopify Shopify Admin & Storefront GraphQL APIs via curl. Products, orders, customers, inventory, metafields.
siyuan SiYuan Note API for searching, reading, creating, and managing blocks and documents in a self-hosted knowledge base via curl.
telephony Give Hermes phone capabilities without core tool changes. Provision and persist a Twilio number, send and receive SMS/MMS, make direct calls, and place AI-driven outbound calls through Bland.ai or Vapi.

research

bioinformatics Gateway to 400+ bioinformatics skills from bioSkills and ClawBio. Covers genomics, transcriptomics, single-cell, variant calling, pharmacogenomics, metagenomics, structural biology, and more. Fetches domain-specific reference material on demand.
darwinian-evolver Evolve prompts/regex/SQL/code with Imbue's evolution loop.
domain-intel Passive domain reconnaissance using Python stdlib. Subdomain discovery, SSL certificate inspection, WHOIS lookups, DNS records, domain availability checks, and bulk multi-domain analysis. No API keys required.
drug-discovery >
duckduckgo-search Free web search via DuckDuckGo — text, news, images, videos. No API key needed. Prefer the `ddgs` CLI when installed; use the Python DDGS library only after verifying that `ddgs` is available in the current runtime.
gitnexus-explorer Index a codebase with GitNexus and serve an interactive knowledge graph via web UI + Cloudflare tunnel.
osint-investigation Public-records OSINT investigation framework — SEC EDGAR filings, USAspending contracts, Senate lobbying, OFAC sanctions, ICIJ offshore leaks, NYC property records (ACRIS), OpenCorporates registries, CourtListener court records, Wayback Machine archives, Wikipedia + Wikidata, GDELT news monitoring.
parallel-cli Optional vendor skill for Parallel CLI — agent-native web search, extraction, deep research, enrichment, FindAll, and monitoring. Prefer JSON output and non-interactive flows.
qmd Search personal knowledge bases, notes, docs, and meeting transcripts locally using qmd — a hybrid retrieval engine with BM25, vector search, and LLM reranking. Supports CLI and MCP integration.
scrapling Web scraping with Scrapling - HTTP fetching, stealth browser automation, Cloudflare bypass, and spider crawling via CLI and Python.
searxng-search Free meta-search via SearXNG — aggregates results from 70+ search engines. Self-hosted or use a public instance. No API key needed. Falls back automatically when the web search toolset is unavailable.

security

1password Set up and use 1Password CLI (op). Use when installing the CLI, enabling desktop app integration, signing in, and reading/injecting secrets for commands.
godmode Jailbreak LLMs: Parseltongue, GODMODE, ULTRAPLINIAN.
oss-forensics |
sherlock OSINT username search across 400+ social networks. Hunt down social media accounts by username.
unbroker Autonomously remove your info from data-broker sites.
web-pentest |

software-development

code-wiki Generate wiki docs + Mermaid diagrams for any codebase.
rest-graphql-debug Debug REST/GraphQL APIs: status codes, auth, schemas, repro.
subagent-driven-development Execute plans via delegate_task subagents (2-stage review).

web-development

cloudflare-temporary-deploy Deploy a Worker live, no account, via wrangler --temporary.