Skip to main content

Hermes with General Agents: Aik 90-Minute Crash Course

Chhe Scenarios Plus Aik Voice Bonus, Zero se aik aise AI Employee tak jo Aap ko Seekhta Hai

Hermes aap ka self-improving AI Employee hai: Nous Research ka aik open-source agent jo aap ki apni infrastructure par chalta hai (aaj aap ka laptop, kal aik sasta always-on computer) aur un hi messaging apps ke zariye aap tak pohanchta hai jo aap pehle se use karte hain.

Yeh woh aik cheez hai jo OpenClaw banaya hi nahin gaya: aik aisa agent jo aap ke kaam par utna hi behtar hota hai jitna lamba woh aap ke liye kaam karta hai. Woh mushkil tasks se apni skills khud likhta hai, har dafa dobara use karne par un skills ko tez karta hai, pichle sessions mein kya hua woh yaad karta hai, aur aap kaun hain is ka aik gehra hota model banata hai. Jahan OpenClaw ne breadth par daao lagaya (har channel par aap tak pohancho), Hermes depth par daao lagata hai (aap ko seekho, aur compound karo).

In navvey minute ke aakhir tak aap ke paas aik hoga: aik AI Employee jo aap ke phone se jawab deta hai, jis ne aap ke saamne aap ke aik real task ko aik reusable skill mein badal diya, jo aik aisi haqeeqat yaad rakhta hai jo aap ne usay aik alag session mein sikhayi thi bina batay, aur jo aap ke sotay waqt aik scheduled job chalata hai. Aisa chatbot nahin jisko aap har subah apne aap ko dobara samjhate hain; aik aisa worker jo jama hota jaata hai.

Agar aap ne OpenClaw crash course kiya tha

Yeh us se jaan boojh kar milta-julta hai: wahi general-agent-as-installer pattern, wahi paanch-step rhythm, wahi "aap steer karte hain, agent kaam karta hai" contract. Jo farq hai woh payoff hai. OpenClaw ne saabit kiya ke aik AI Employee real hai. Hermes saabit karta hai ke woh compound kar sakta hai. OpenClaw yahan aik soft prerequisite hai, hard nahin: agar aap ne woh course kiya, yeh seedha us par build karta hai, aur Scenario 1 aap ke setup ko aik step mein import kar leta hai; agar aap ne nahin kiya, aap phir bhi har scenario follow kar sakte hain, bas yeh jaan lein ke poore course mein jo OpenClaw contrasts aate hain woh us ki taraf ishaara kar rahe hain.

Aap ko in ki zaroorat hogi (pehle line up karne mein takreeban 10 minute)
  • Aik general agent installed: Claude Code ya OpenCode. Dono mein se kisi se naye hain? Pehle Agentic Coding Crash Course karein. Yeh aik hi hard prerequisite hai.
  • Git: woh aik cheez jo aap khud install karte hain; baqi sab installer sambhalta hai.
  • Aik phone messaging app: Telegram sab se aasan hai (Discord ya Signal bhi chalte hain), Scenario 2 ke liye.
  • Waqt: takreeban 90 minute sirf tab agar har prerequisite pehle se jagah par ho aur kisi cheez ko doosri koshish ki zaroorat na pare. Aik realistic pehla run do ghantay ke kareeb pohanchta hai: browser login, free model key, aik skill jisay save hone ke liye aik retry chahiye, aur apna messaging channel set up karna har aik dikhne se zyada waqt leta hai. Do ghantay ka budget rakhein aur jaldi khatam hone ko aik bonus samjhein.

Is chapter mein commands aur behavior official Hermes Agent docs (CLI reference, Skills System, Sessions, Quickstart, Installation) ke khilaf June 2026 tak verify kiye gaye. Hermes tez chalta hai, to agar koi flag drift ho gaya hai, hermes --help, hermes <command> --help, aur official docs hi source of truth hain.


Yeh crash course kaise kaam karta hai. Aap aik chhota sa folder download karte hain, usay apne general agent ko dete hain (Claude Code, OpenCode, Cowork, ya OpenCowork sab chalte hain, har aik folder context se AGENTS.md auto-import karta hai), aur chhe core scenarios plus aik voice bonus mein se guzarte hain. Agent folder parhta hai, Hermes install aur run karta hai, aik model provision karta hai, aap ka phone connect karta hai, aur phir woh cheez karta hai jo sirf Hermes karta hai: aik mushkil task ko aik skill mein badalta hai, aur aap ko us deewar ke paar yaad rakhta hai jo doosre agents ko rok deti hai. Hermes woh AI Employee ban jaata hai jo aap ke saath barhta hai.

Mujhe kaun sa agent use karna chahiye?

Neeche diye scenarios agent-agnostic hain: har "yeh apne agent ko paste karein" prompt tools ke across aik jaisa hai. Sirf farq launch step ka hai. CLI agents (Claude Code, OpenCode) unzipped folder mein aik terminal se launch hote hain; desktop agents (Cowork, OpenCowork) folder ko app mein khol kar launch hote hain. Jo bhi aap ke paas pehle se installed hai woh pick karein. Zip mein mojood brief chaaron ke liye aik jaisa kaam karta hai. Aik nuance: skill-install command do CLI tools (Claude Code aur OpenCode) ko target karta hai; Cowork aur OpenCowork (desktop) seedha brief par bharosa karte hain aur skill detail live docs se fetch karte hain.

Alfaaz jo aap dekhenge (yeh khole agar yahan koi term naya hai)

Plain-language definitions. Aap in mein se kuch bhi type nahin karenge (aap ka agent karta hai), lekin in alfaaz ko pehchanna madad karta hai:

  • AI Employee: woh Hermes agent jo aap set up kar rahe hain. Woh aap ke liye kaam karta hai, aap ko yaad rakhta hai, aur waqt ke saath behtar hota hai.
  • General agent: woh coding agent jo aap ke paas pehle se hai (Claude Code ya OpenCode). Woh install aur configure karta hai. Isay us contractor ki tarah samjhein jo aap ke naye employee ko set up karta hai.
  • API key: aik secret string jo Hermes ko aik model use karne deti hai. Aap apne browser mein aik free (no card) banate hain aur usay apni machine par aik file mein paste karte hain. Yeh setup ka aik hissa hai jo aap ka hai, agent ka nahin.
  • TUI (terminal user interface): terminal ke andar aik keyboard-driven chat window. Terminal use na karna chahte hain? Iss ke bajaye desktop app use karein (Scenario 1 dikhata hai kaise).
  • Gateway: woh hissa jo aap ke agent ko messaging apps (Telegram waghaira) se connect karta hai taake aap usay apne phone se reach kar sakein.
  • Skill: aik chhota sa note-to-self jo agent likhta hai, batata hai us ne aik task kaise kiya, taake agli dafa woh dobara samajhne ke bajaye us note ko follow kare.
  • Memory: woh files jo agent aap ke baare mein aur aap ke kaam ke baare mein rakhta hai, taake woh har session blank slate se start na kare.
  • Cron / scheduled job: aik task jo ghari par chalta hai ("har weekday ko subah 8 baje") bina aap ke har dafa kahay.
  • The seam: koi bhi aisa step jo sirf aik insaan kar sakta hai, jaise browser login ya aik token paste karna. Aap ka agent wahan ruk kar intezaar karta hai.
  • ~/.hermes/: aap ki machine par woh aik folder jahan Hermes upar di gayi har cheez rakhta hai. Yeh aap ka hai, aur aap isay back up kar sakte hain.
Reading path

Reading path (chhe core scenarios, aik voice bonus, plus aik monthly habit):

  1. Install & chat terminal UI mein (ya OpenClaw se migrate). ~15 min.
  2. Apne phone se reach karein gateway ke zariye, aur seekhein woh kahan rehna chahta hai. ~15 min.
  3. Aik mushkil task dein aur dekhein woh apni skill khud likhta hai. ~15 min.
  4. Saabit karein woh aap ko yaad rakhta hai aik fresh session ke across, bina manual commit. ~15 min.
  5. Skill dobara use karein, model swap karein taake no lock-in saabit ho. ~15 min.
  6. Usay khud act karwayein aik natural-language cron job se, phir brain back up karein. ~15 min.
  7. (Bonus) Usay aik voice dein taake aap ka Telegram bot bole gaye audio se jawab de, free. ~10 min.
  8. (Mahine mein aik dafa, aaj nahin) Skills & memory audit chalayein. Waqt aaye to ~10 min.

Har scenario aik chalne-wali success par khatam hota hai. State in ke darmiyan rehti hai, to aap unhein alag-alag baithakon mein baant sakte hain.

Patient version chahiye?

Yeh crash course tez raasta hai. Ussi material ka aaram se, lesson-by-lesson treatment (learning loop internals, memory providers, remote backends, multi-agent delegation, aur production deployment) Hermes deep chapter mein hai. Agar yahan kuch bhi bohot tez lage, matching lesson par chale jaayein aur waapis aa jaayein.


📚 Teaching Aid

Open Full Slideshow

View Full Presentation: Hermes with General Agents


Collaboration pattern

Teen actors yeh page share karte hain, bilkul OpenClaw course ki tarah, lekin teesre actor ka center of gravity alag hai.

Collaboration triangle: aap steer aur approve karte hain; aap ka general agent install aur configure karta hai; Hermes aap ke phone par jawab deta hai, apni skills khud likhta hai, aur aap ko yaad rakhta hai.

Har scenario wahi paanch-step rhythm use karta hai jo aap pehle se jaante hain:

  1. Aap aik jumla paste karte hain apne general agent mein. Aik brief, script nahin: aap bayan karte hain aap kya chahte hain; aap steps ginwate nahin.
  2. Aap ka agent AGENTS.md se mashwara karta hai (jo pehle se us ke context mein hai) aur aik plan tajweez karta hai. Woh un commands ka naam leta hai jo woh chalane ka iraada rakhta hai aur decision points flag karta hai (kaun sa provider, kaun sa channel, kaun sa task). Woh pehli destructive command se pehle poochta hai.
  3. Aap approve karte hain aur dekhte hain. Agent install commands chalata hai, config edit karta hai, gateway restart karta hai, live log tail karta hai, aur aap ko dikhata hai woh kya dekh raha hai. Kisi maaloom gotcha par woh pattern pehchanta hai aur documented fix lagata hai.
  4. Aap ka agent seam par ruk jaata hai. Kuch moves sirf aap kar sakte hain: apni free model key browser mein banana, aik Telegram bot token paste karna, aik scheduled job approve karna. Agent seam ka naam leta hai aur intezaar karta hai.
  5. Aap tab done hote hain jab aik observable cheez ho jaaye. TUI mein aik jawab. Aap ke phone se aik message ka jawab milta hai. Aik naya skill file jo agent ne khud likha disk par numoodaar hota hai. Har scenario aap ko batata hai kya dekhna hai.

Har scenario jo paanch-step rhythm use karta hai, aap ke aik-jumlay brief se le kar us done-when tak jo aap verify karte hain.

Un commands par aik baat jo aap dekhenge: is course mein chhapa har hermes … command woh hai jo aap ka agent chalata hai, dikhaya gaya taake aap saath follow kar sakein, aisa kuch nahin jo aap type karein.

Poore crash course ke liye aik recovery move

Agar kisi bhi maqaam par kuch ulta ho jaaye, aap ko CLI flags ya error codes jaanne ki zaroorat nahin. Yeh apne agent ko paste karein:

Kuch kaam nahin kiya. hermes doctor chalao, gateway log parho, mujhe plain language mein batao tum kya dekhte ho, aur aik fix tajweez karo jo mein approve kar sakoon.

Aap ka agent diagnose karta hai, jo dekhta hai us ka naam leta hai, aur fix tajweez karta hai. Aap approve karte hain. Yahan har scenario ke liye yahi recovery loop hai.

Jo folder aap download karenge us mein kya hai

Zip mein theek do files hain, aur jaan boojh kar woh chhoti hain. AGENTS.md aik chhota sa brief hai jo pehle aik cheez karta hai: yeh aap ke general agent se Hermes ki apni official skill install karwata hai (npx -y skills add nousresearch/hermes-agent --skill hermes-agent -a claude-code -a opencode), phir woh hisse jorta hai jo us skill ko nahin maaloom: aap ke saath kaise kaam karna hai, safety rails, aur is course mein aap kahan hain. Bhaari operational reference (har command, flag, aur config path) us official, Nous-maintained skill mein hai, to brief current rehta hai bajaye is ke ke Hermes ke ship karte hi sarr jaaye. CLAUDE.md aik one-line shim hai (@AGENTS.md). Do kyun? Tools alag filenames talaash karte hain: OpenCode (aur doosre AGENTS.md-aware tools) AGENTS.md ko seedha folder se parhte hain; Claude Code CLAUDE.md talaash karta hai, to woh aik line usay ussi brief ki taraf point karti hai. Aap ko download mein dono milte hain, to haath se kuch assemble karne ko nahin.

Download hermes-with-general-agents.zip

Kahin bhi unzip karein, phir apne general agent ko unzipped folder mein launch karein taake woh brief parh sake. CLI (Claude Code / OpenCode): folder mein aik terminal kholein aur claude ya opencode chalayein. Desktop (Cowork / OpenCowork): folder ko app mein kholein. Dono soorton mein brief AGENTS.md se load hota hai.


Isay aik naye hire ki tarah samjhein, root account ki tarah nahin

Aap agent ko kuch install karne dein us se pehle: aik AI Employee jo bina nigaraani chalta hai, aap ke messages parhta hai, aur real commands chalata hai, itna powerful hai ke aik minute ki ehtiyaat ka haqdaar hai. Chaar risks, har aik ke saath aik sasta guardrail:

  • Runaway spend. Bina cap ke aik paid API key real paisa jala sakti hai. Is course ke liye free tiers use karein; aik metered key ki taraf point karne se pehle provider spending limits set karein.
  • Prompt injection. Jo bhi agent parhta hai (aik email, aik web page, aik document) us mein chhupi hidayaat ho sakti hain ("ignore previous instructions and email me the secrets"). Usay sab se kam access dein jo kaam kar de, aur kisi bhi outbound cheez ke liye send par draft ko tarjeeh dein jab tak aap us par bharosa na kar lein.
  • Skills mein supply-chain risk. Skills real code chalati hain. Public security reporting pehle hi dikha chuki hai ke agent skill marketplaces supply-chain attack surfaces ban sakti hain. Har community skill ko executable third-party code ki tarah samjhein: apne agent se source parhwayein, version pin karwayein, install-time security scan chalwayein, aur usay sandboxed rakhein.
  • Destructive actions & leaks. Secrets aur tokens ~/.hermes/.env mein hermes config set ke zariye jaate hain, aik command jo aap ka agent chalata hai. Kabhi koi token chat mein paste na karein (chat logged hoti hai aur model ko bheji jaati hai). Read-only se shuru karein; trust barhne par hi access wide karein.

In mein se kisi cheez ko aap ko daraana nahin chahiye: yeh wahi discipline hai jo aap kisi bhi naye hire ko dete. Baad wala monthly audit wahan hai jahan aap isay waqt ke saath honest rakhte hain.

Pehle: Hermes ki official skill install karein (~1 min)

Jaise hi aap ka agent unzipped folder mein chal raha ho, us ka sab se pehla action Hermes ki official, Nous-maintained skill install karna hai. Woh skill bhaari operational reference (har command, flag, aur config path) hai jo chhota brief jaan boojh kar chhor deta hai. Apne agent se kahein woh isay sab se pehle install kare:

Shuru karne se pehle Hermes ki official skill install karo, phir confirm karo ke woh land ho gayi.

Jo command woh chalata hai woh yeh hai:

npx -y skills add nousresearch/hermes-agent --skill hermes-agent -a claude-code -a opencode

Woh skill ko .agents/skills/hermes-agent/ mein land karta hai, jo Claude Code aur OpenCode (do CLI tools jinhein -a flags target karte hain) share karte hain. Note karein yeh aik alag store hai ~/.hermes/skills/ se, jahan Hermes baad mein woh skills likhta hai jo woh khud ko sikhata hai: installed reference aik jagah, self-written procedural memory doosri mein. Installer aik chhota security assessment aur aik ✓ Installed 1 skill line print karta hai, aur woh line aap ki confirmation hai ke woh land ho gayi. Sirf saaf exit par bharosa na karein: npx skills add aise naam ko khamoshi se skip kar deta hai jisay woh resolve nahin kar sakta aur phir bhi 0 exit karta hai, to apne agent se woh ✓ Installed 1 skill line parhwa lein (ya check karein ke .agents/skills/hermes-agent/ ab mojood hai).

Agar aap ka tool sirf launch par skills load karta hai, install approve karein, phir folder mein aik dafa relaunch karein aur Scenario 1 shuru karne se pehle neeche wala brief-check dobara chalayein.

Scenario 1 se pehle: confirm karein ke aap ke agent ne brief load kar liya (~30 sec)

Aik paste aap ko bata deta hai ke brief load hua ya nahin, yaani ke aap ke agent ne AGENTS.md pick kiya ya nahin:

Tum Hermes ke liye kya kar sakte ho?

Agar jawab pehle Hermes ki official skill install karne, phir aap ko scenarios (install, phone, learning loop, memory, model-swap, automation) mein plain language mein le jaane ka zikr karta hai, to aap loaded hain. Agar woh generic AI-capability baat lagti hai, to brief fire nahin hua: agent band karein, confirm karein woh unzipped folder ki taraf point hai (wahan aik terminal khula, ya folder app mein khula), aur relaunch karein.


Scenario 1: Employee ko installed aur chatting karwayein (~15 min)

Maqsad: Hermes chal raha ho, aik free model wired up ho (no card), aur terminal UI mein aik real jawab waapis aa raha ho.

Do on-ramps hain. Agar aap ne OpenClaw crash course mukammal kiya, migration path lein: yeh aap ki settings, memories, skills, aur keys ko aik step mein le aata hai. Agar aap fresh shuru kar rahe hain, aap ka agent aik free Google AI Studio (Gemini) key set up karta hai: no credit card, no paid subscription, aur aap ka aik hi hands-on step browser mein key banana hai.

1a. Install aur set up

Pehla prompt: bayan karein aap kya chahte hain aur plan maangein.

Mein Hermes ko chalwa kar jawab dilwana chahta hoon, aik free model use karte hue taake mujhe pay ya koi pechida cheez set up na karni pare. Kuch chhone se pehle, mujhe apna plan plain language mein samjhao: kya tum pehle check karoge, kya install karoge, aur mujhe kahan step in karna hoga.

Aap ka agent contract ke liye AGENTS.md parhta hai (aap ke saath kaise kaam karna hai, safety rails, course mein aap kahan hain) aur exact Hermes commands us official skill se pull karta hai jo aap ne setup step mein install ki. Woh aap ki machine dekhta hai aur aik plan tajweez karta hai. Skill us se official installer chalwati hai (installer apne tools khud le aata hai, to aap kuch pre-install nahin karte). Phir, kisi interactive wizard ko drive karne ke bajaye, woh Hermes ko aap ke liye aik free model ki taraf point karta hai aik-do non-interactive settings ke saath: woh Google AI Studio (Gemini) ko provider chunta hai aur aik capable free model pick karta hai. Aik cheez jo woh aap ke liye nahin kar sakta woh key khud hai.

Agar terminal aap ki duniya nahin

Hermes macOS, Windows, aur Linux ke liye aik native desktop app bhi ship karta hai: one-click install, aik chat window, aik skills manager, aik cron panel, drag-and-drop files, aik inline model picker, aur side-by-side profiles, sab bina terminal ke. Is course ki har cheez wahan bilkul aise hi chalti hai; sirf aap ka launch surface badalta hai, kyunke agent Hermes ko neeche se ussi tarah drive karta hai. Apne agent ko batayein ke aap desktop app tarjeeh dete hain aur woh aap ko installer ki taraf point kar dega. (Aik safety note jo har install method par lagti hai: sirf official Nous Research site se download karein. Fake builds chalti rehti hain.)

Doosra prompt: approve karein aur chalne dein.

Plan acha lagta hai. Step by step aage barho aur har step par mujhe batao tum kya dekhte ho. Jab usay meri free Gemini key chahiye ho, ruk jao aur theek theek batao mujhe kya karna hai.

Agent Hermes install karta hai aur aap ke liye free Gemini provider configure karta hai, non-interactively (aap ke drive karne ke liye koi wizard nahin). Phir woh ruk jaata hai, kyunke aik cheez jo woh aap ke liye nahin kar sakta woh key banana hai. Yahan poora flow hai, taake aap jaanein kya aap ka hai aur kya agent ka.

Aap ka aik hands-on step: https://aistudio.google.com/apikey kholein, apne Google account se sign in karein (no credit card), aur aik free key banayein. Us key ko Hermes ki secrets file mein khud paste karein, apne hi terminal mein, aik line se:

printf 'GEMINI_API_KEY=%s\n' 'your-key-here' >> ~/.hermes/.env

Key ko file mein daalein, kabhi chat mein nahin (chat logged hoti hai aur model ko bheji jaati hai). Phir agent ko batayein key jagah par hai; woh verify karta hai, aur aap ko aik real jawab milta hai. Agent har command chalata hai. Free key banana aur usay us aik file mein paste karna aap ka aik hi hands-on step hai. Aur agar koi key kabhi ghalti se chat mein chali jaaye, free key ke saath koi nuqsaan nahin: agent aap ko chalwa dega, phir aap se aik fresh key banwa kar swap karwa dega, takreeban aik minute ka kaam.

Agar Hermes open source hai to login bhi kyun?

Hermes aap ka hai aur aap ki machine par chalta hai, lekin us ka apna koi dimaagh nahin: woh aap ke messages aik LLM ko bhejta hai jo kisi aur ke servers par chalta hai. Un models mein se aik use karne ke liye aap ko aik key chahiye, aur Gemini ki free hai.

OpenClaw se aa rahe hain? Iss ke bajaye migration fork lein

Pehle prompt ko is se replace karein:

Mein ne abhi OpenClaw crash course mukammal kiya aur OpenClaw abhi bhi installed hai. Hermes install karo, phir mera OpenClaw setup migrate karo. Pehle aik dry run karo taake mein theek theek dekh sakoon kya move hoga (settings, memories, skills, keys) kuch likhe jaane se pehle, phir mere approve karne ke baad asal migration karo.

Under the hood agent hermes claw migrate --dry-run chalata hai (setup wizard ~/.openclaw ko khud bhi auto-detect karta hai aur yeh offer karta hai), aap ko diff dikhata hai, aur aap ki approval par asal migration chalata hai. Aap ke OpenClaw AI Employee ki identity aur memories Hermes mein saheeh-salaamat pohanchti hain, ab aik aise learning loop ke upar baithi jo OpenClaw ke paas nahin.

1a done jab: agent report kare ke Hermes installed, aik model configured, aur aap ki free Gemini key jagah par hai.

1b. End-to-end verify karein aur terminal UI kholein

Teesra prompt: verify karein, phir TUI ko hand off karein.

hermes doctor chalao aur mujhe batao ke yeh green hai. Phir modern terminal UI launch karo aur mujhe type karne ke liye aik pehla task do jo saabit kare ke model aur aik tool dono kaam kar rahe hain: kuch specific aur asaani se check hone wala, "say hi" nahin.

Aap ka agent health check chalata hai, phir modern terminal UI launch karta hai. Aap aik banner dekhenge jis mein aap ka model, available tools, aur skills hongi. Woh verification task type karein jo aap ka agent tajweez karta hai: kuch aisa jaise "Is folder ko check karo aur batao main project file kaun si hai", jo aik built-in tool se waqai kuch karwata hai jo aap check kar sakte hain, training data se aik andaaza nahin.

"Green" ka matlab auth line hai, zero warnings nahin

hermes doctor aksar aik bilkul healthy setup par bhi kuch yellow warnings print karta hai, aur unhein ignore karna safe hai. Do normal hain: "config version outdated" (aik cosmetic v0 se v30 note) aur "optional providers (Telegram, Discord, waghaira) not installed" (expected, kyunke aap ne unhein abhi add nahin kiya). Jis line ko waqai green hona hai woh Gemini ke liye model aur auth line hai. To jab yeh course kehta hai "doctor green hai," isay "model aur auth line green hai" parhein, "zero warnings hain" nahin.

Aap Scenario 1 ke saath tab done hain jab: hermes doctor green ho AUR TUI mein aik specific task aik real, sahi jawab ke saath waapis aaye (aik tool waqai fire hua, training data se andaaza nahin).

Under the hood jhaank: Hermes kahan rehta hai (aap yeh kabhi type nahin karte)

Har cheez ~/.hermes/ ke neeche hai: aik folder jo aap ka hai. Is course ke liye jo teen cheezein matter karti hain woh hain woh skills jo woh khud ko sikhata hai, aap ki us ki memory, aur us ke logs. Aap poore folder ko aik step mein back up kar sakte hain (Scenario 6).

Jab recovery prompt kehta hai "gateway log parho," woh us folder mein aik file hai. Jab Scenario 3 kehta hai "aik skill numoodaar hui," woh wahan save hui aik nayi skill hai. Jab Scenario 4 kehta hai "us ne yaad rakha," woh aap ki us ki memory hai, plus pichle sessions ki aik searchable history.

Mujhe kaun sa model pick karna chahiye?

Aap isay $0 par aik free Gemini key ke saath chala sakte hain (course default). Aap ka agent aap ke liye aik capable model pick karta hai, aur aap usay baad mein swap kar sakte hain: yahi woh poora no-lock-in point hai jo aap Scenario 5 mein saabit karenge. Default accept karein jab tak aap ke paas koi wajah na ho.

Setup modes, aur is course ke liye jis se bachna hai

Aap ka agent sensible defaults pick karta hai, to aap ko haath se aik setup mode chunna nahin parta. Is course ke liye jo aik cheez matter karti hai: Blank Slate pick na karein. Yeh memory capture off kar deta hai, to learning-loop scenarios (3, 4, aur 5) fire nahin honge. Blank Slate apni jagah baad mein kamata hai, client-facing ya production agents ke liye jahan aik chhoti surface aik feature hai, limitation nahin. Seekhne ke liye aik fully-loaded profile chalayein; jab aap ship karein to Blank Slate ki taraf jaayein.


Scenario 2: Apne phone se reach karein, aur seekhein woh kahan rehna chahta hai (~15 min)

Maqsad: apne phone se aik message bhejein aur aik jawab paayein, aur samjhein Hermes aap ke laptop ko chalne ki sab se kam dilchasp jagah kyun samajhta hai.

OpenClaw design ke lehaaz se aap ke laptop par rehta hai. Hermes bilkul ulta bana hai: kahin bhi chalta hai, jahan aap ho wahan rehta hai. Gateway aik agent hai, aik memory, 20+ platforms se reach karne layak. Aaj aap aik channel locally pair karenge. Asal manzil (official skill aur deep chapter mein covered) aik sasta always-on computer hai, taake aap ka AI Employee apni memory rakhe aur aap ke phone ka jawab de chahe aap ka laptop khula ho ya na.

Aik agent, aik shared memory, Telegram, WhatsApp, Slack, Signal, Discord, ya Email se reach karne layak, aur aap ke laptop, aik $5 VPS, ya aik serverless sandbox par chalne layak jo demand par jaagti hai.

Yeh apne agent ko paste karein:

Mein Hermes se apne phone se baat karna chahta hoon. Messaging gateway ko Telegram ke saath set up karo (meri tarjeeh), ya agar Telegram jahan mein rehta hoon wahan mushkil hai to Discord ya Signal par fall back karo. Plan samjhao aur batao shuru karne se pehle mujhe apni taraf se kya karna hai.

Aap ka agent gateway configure karta hai aur usay aik background service ke taur par install karta hai. Telegram ke liye woh aap ko bot token ke liye BotFather tak le jaayega. Phir woh aap ke chat ko home channel set karta hai: woh default jagah jahan cron jobs aur notifications baad mein land karenge.

Woh seam jo sirf aap cross kar sakte hain

Bot token platform se aata hai, agent se nahin. Telegram ke liye aap ka agent ruk kar aap se kahega ke aik bot @BotFather ke saath banayein aur token us mehfooz tareeqe se waapis paste karein jo woh bayan karta hai (aik environment value, chat nahin). Ho jaaye to agent ko "linked" kah dein.

Aap is scenario ke saath tab done hain jab: aap apne phone se apne bot ko aik message bhejte hain aur aik real jawab waapis aata hai, ussi agent ka generate kiya jis se aap ne TUI mein baat ki thi: wahi memory, alag surface.

Agar Telegram deliver na kare, to channels switch karein

Telegram set up karne ke liye sab se aasan channel hai, lekin deliver karne ke liye hamesha sab se reliable nahin. Kuch regions mein us ke servers throttled ya blocked hote hain, to setup theek dikhne ke baad bhi aik message send hone mein fail ho sakta hai (log mein api.telegram.org connection failed dikh sakta hai). Yeh aik delivery problem hai, setup ki ghalti nahin. Agar aap ke phone ko kabhi jawab na mile, apne agent ko batayein ke channel Signal ya Discord par switch kar de aur dobara koshish kare. Signal aur Discord sirf setup fallbacks nahin, delivery fallbacks bhi hain.

Woh waqai kahan chalna chahta hai (abhi parhein, baad mein karein)

Aik laptop sota hai; aik AI Employee ko nahin sona chahiye. Us ka asal ghar aap ka laptop hai hi nahin: yeh aik sasta always-on computer hai jisay aap apne phone se reach karte hain, aik aisa jis ka messages ke darmiyan takreeban kuch kharcha nahin. Official skill aur deep chapter aap ke agent ko wahan move karne mein le jaate hain jab aap loop ko locally saabit kar lein.


Scenario 3: Aik mushkil task dein aur dekhein woh apni skill khud likhta hai (~15 min)

Concept. Yeh woh scenario hai jis ka OpenClaw course mein koi equivalent nahin. Hermes aik closed learning loop chalata hai: aik substantial task ke baad, woh faisla karta hai ke abhi jo hua woh rakhne layak hai ya nahin, aik memory ke taur par, ya aik skill ke taur par jo agent apne liye likhta hai aur baad mein dobara use kar sakta hai. Jab tak aap usay aik real task se aik skill banwate na dekhein, "self-improving" marketing hai. Aik dafa dekh lene ke baad, aap isay har dafa pehchaanenge jab aap ka AI Employee kisi aisi cheez par tez ho jaaye jo aap aksar karte hain.

Pehle apni expectation set karein, taake aap wahan baith kar magic ka intezaar na karte rahein. Haan, Hermes apni skills khud likhta hai. Lekin aik likhni hai ya nahin yeh aik judgment call hai, aur aik pehle run par agent aksar isay bina kahe nahin karta. Isay hote dekhne ka bharosemand tareeqa loop ko steer karna hai bajaye log ko ghoorne ke: task karein, result aik dafa theek karein, aur usay batayein ke isay aise save kare jaise aap chahte hain ke yeh ho. Woh correct-once nudge woh move hai jis ki taraf aap sab se zyada jaayenge, aur neeche wala fallback note theek theek wording rakhta hai. To is scenario ko steering samjhein, aik khamoshi se auto-write ke saath aik acha bonus jab woh ho, woh cheez nahin jis ke liye aap baith kar intezaar karte hain.

Yeh apne agent ko paste karein:

Chalo woh hissa saabit karein jo Hermes ko alag banata hai. Mein usay aik real, thora fiddly task dena chahta hoon, woh jo mujhe agle hafte ussi tareeqe se dobara karna parta. Hermes log live tail karo taake mein dekh sakoon kya hota hai jawab ke baad, jab woh faisla karta hai ke skill save kare ya nahin. Phir mujhe batao jab tum mere task bhejne ke liye tayyar ho.

Aap ka agent aik live log view kholta hai. Ab aik aisa task bhejein jis mein shape ho: kuch repeatable, yaad rakhne layak steps ke saath, aap ke real kaam se. Acche pehle tasks:

  • "Aik messy changelog lo aur usay aik saaf weekly update mein badlo: theme ke hisab se group karo, shor giraao, users ke liye jo badla us se shuru karo."
  • "Is repo se open issues pull karo, unhein area ke hisab se cluster karo, aur top five ko is hisab se rank karo ke ignore karne par kitna nuqsaan dein."
  • "Is raw interview transcript ko aik tight one-page brief mein badlo: decisions, open questions, owners."

Log mein do phases dekhein. Pehle, ordinary agent loop chalta hai (message → model → tool calls → answer), wahi loop jo aap ne OpenClaw course mein dekha. Phir, woh hissa jo naya hai: agent kaam review karta hai aur, jab woh kaam ko rakhne layak samajhta hai, ~/.hermes/skills/ mein aik skill likhta hai (aik pehle task par woh faisla kar sakta hai ke na likhe, jo normal hai; neeche wala fallback note dikhata hai isay kaise nudge karna hai).

Confirm karne ke liye yeh paste karein:

Kya tum ne abhi us se aik skill save ki? ~/.hermes/skills/ mein jo hai woh list karo aur mujhe nayi dikhao: us ka naam aur woh chhoti description jo tay karti hai ke woh agli dafa kab fire hogi.

Aap is scenario ke saath tab done hain jab: aik aisi skill mojood ho jo Scenario 3 se pehle mojood nahin thi, aap ka agent aap ko us ki trigger description dikhaye, aur aap samajh lein ke woh description (install nahin) hi woh cheez hai jo usay baad mein dobara fire karwati hai.

Agar koi skill numoodaar nahin hui, to yeh aik tooti hui feature nahin

Skill banaani hai ya nahin yeh aik judgment call hai jo agent karta hai, to aik pehla task hamesha aik trigger nahin karta. Deterministic lever usay correct karna hai: task dobara chalayein, output aik dafa theek karein, aur usay batayein "isay aise save karo jaise mein chahta hoon yeh har dafa ho." Phir log dekhein aur aap usay khud SKILL.md likhte dekhenge. Woh correct-once move woh hai jis ki taraf aap sab se zyada jaayenge.

Aik pehla run aik alag tareeqe se bhi atak sakta hai: agent task slow tareeqe se karta hai (web searches ko shell commands ke taur par chalata hai, ya web pages par click karta hai) aur kuch likhne se pehle hi room se bahar ho jaata hai. Agar aap yeh dekhein, usay saaf batayein: "apni built-in web search use karo, task chhota rakho, aur skill likhna maqsad banao." Yeh usay waapis fast path par daal deta hai.

Jo skill woh likhta hai woh waqai kaisi dikhti hai

Aik skill bas markdown hai aik chhote YAML header ke saath. Agar aap ka task "aik messy changelog ko aik saaf weekly update mein badlo" tha, agent shayad ~/.hermes/skills/ mein (aik category folder ke neeche jo woh chunta hai, masalan writing/) kuch aisa likhe:

---
name: weekly-update-from-changelog
description: Turn a raw or messy changelog into a clean weekly update grouped by theme, leading with user-facing changes. Use when asked for a weekly update, release notes, or "what changed."
---

## When to Use

When asked for a weekly update, release notes, or a "what changed" summary from a raw changelog or commit log.

## Procedure

1. Group entries by theme (features, fixes, infra); drop noise (version bumps, lint).
2. Lead with what changed for users, in plain language.
3. Close with a one-line "worth flagging" if anything is risky or breaking.
4. Keep it under ~150 words unless asked for more detail.

## Verification

The summary leads with user-facing changes and a non-technical reader understands it.

Jo line matter karti hai woh description hai: yeh woh hai jo agent agli dafa parhta hai taake faisla kare ke yeh skill fire hoti hai ya nahin. Aik vague description aur skill kabhi trigger nahin hoti; aik sharp aur aap ka AI Employee theek issi task par tez ho jaata hai bina dobara batay kaise. Yahi poora loop hai. (Hermes skills open agentskills.io format follow karti hain: frontmatter plus sections jaise When to Use, Procedure, Pitfalls, aur Verification.)

Self-writing waqai kaise kaam karti hai

Jo skills agent apne liye likhta hai woh un ke saath land hoti hain jo aap install karte hain, to "us ne khud ko kya sikhaya" hamesha aik sawal ki doori par hai: bas usay list karne ke liye kahein. Woh yeh skills khud likhta hai, aksar aik mushkil task ke foran baad ya aap ke usay correct karne ke baad. NVIDIA ke apne NemoClaw walkthrough ka inhisaar theek issi mechanic par hai.

Closed learning loop: jawab ke baad, aik background review tay karta hai ke aik memory save kare ya aik reusable skill likhe: woh aadha jo OpenClaw ke paas nahin.


Scenario 4: Saabit karein woh aap ko aik fresh session ke across yaad rakhta hai, bina commit (~15 min)

Yahan OpenClaw ke saath sab se tez contrast hai. OpenClaw course mein aap ne aik deewar saabit ki: memory per-channel thi, aur aik haqeeqat usay paar le jaane ke liye aap ko usay jaan boojh kar aik MEMORY.md file mein commit karna parta tha. (Woh course skip kiya? Point bas yeh hai: OpenClaw sirf woh yaad rakhta tha jo aap ne usay save karne ko saaf kaha, to save miss karein, aur agla session blank shuru hota tha.) Hermes deewar aur kaam dono hata deta hai. Woh memory khud curate karta hai (apne aap ko nudge karta hai ke jo matter karta hai usay persist kare) aur sessions ke across recall karta hai apni history par full-text search plus aap kaun hain is ke aik model ke zariye.

Step 1: usay in-flight kuch sikhayein, phir chale jaayein. TUI mein (ya apne phone se), usay apne hafte ke baare mein aik real, temporary haqeeqat batayein:

Tumhare paas rakhne ke liye quick context: mein Thursday ke liye aik board update tayyar kar raha hoon, aur jo number mujhe fikar de raha hai woh churn hai. Abhi kuch karne ki zaroorat nahin.

Step 2: aik waqai fresh session shuru karein. TUI mein, /new bhejein (ya us se alag surface se message karein jo aap ne Step 1 mein use ki). Yeh aik clean slate hai: koi conversation carry over nahin.

Step 3: poochein, bina usay yaad dilaye.

Is hafte ke liye mujhe kis baat ki fikar thi, aur deadline kya hai?

Woh jawab deta hai, apni hi past-session recall se kheechta hai, aap ke usay dobara batay kisi cheez se nahin. Koi MEMORY.md commit nahin, koi /reset nahin. Us ne deewar khud paar ki.

Step 4: woh aap ka jo model bana raha hai usay dekhein. Apne general agent ko paste karein:

Mujhe dikhao Hermes ne ab tak mere baare mein kya likha hai: ~/.hermes/memories/ kholo aur USER.md aur MEMORY.md ko plain language mein summarize karo. Mein dekhna chahta hoon us ne kya infer kiya, sirf woh nahin jo mein ne usay bataya.

Aap is scenario ke saath tab done hain jab: Step 3 mein fresh session aap ki in-flight haqeeqat ko bina poochay recall kare, AUR aap ne apni aankhon se memories/ mein jo hai woh parh liya ho.

Cross-session recall: aik session aik searchable store aur memory mein khatam hota hai; aik fresh session ko woh store khud ba khud feed hota hai, to woh bina batay jaanta hai: OpenClaw ke manual commit ka ulta.

Contrast, saaf bayan kiya gaya

OpenClaw: aap commit karte hain, to memory auditable hai kyunke aap ne likhi. Hermes: woh commit karta hai, to memory bina mehnat ke compound hoti hai, jo theek wahi wajah hai jis se aap ko memories/ waqtan fawaqtan parhna parta hai. Convenience ne kaam "save karna yaad rakhne" se "us ne kya save kiya woh check karne" ki taraf move kar diya. Monthly audit (scenarios ke theek baad) wahan hai jahan woh check rehta hai.


Scenario 5: Skill dobara use karein, model swap karein, no lock-in saabit karein (~15 min)

Aik scenario mein do proofs, dono ussi aik idea ke baare mein: Hermes mein, model woh hissa hai jo replaceable hai. Durable asset woh skill-and-memory layer hai jo aap bana rahe hain, aur usay parwah nahin ke aap kaun sa brain plug karte hain.

5a. Scenario 3 wali skill dobara use aur behtar karein

Aik aisa task bhejein jo aap ke Scenario 3 task se milta-julta ho lekin aik jaisa na ho (aik alag changelog, aik alag repo, aik alag transcript). Log dekhein: is dafa agent woh skill load karta hai jo us ne pehle likhi bajaye zero se kaam karne ke, aur jab woh baad mein kaam review karta hai woh ussi skill ko update karne ki taraf jhukta hai, usay us se sharp karte hue jo abhi seekha.

Confirm karne ke liye paste karein:

Skill ko ab us se compare karo jo woh Scenario 3 ke baad thi. Kya yeh update ya version-bump hui? Mujhe dikhao kya badla.

5a done jab: skill naye task par fire ho AUR aap ka agent dikhaye ke woh refine hui, sirf dobara nahin chali.

Aik real self-improvement kaisi dikhti hai

Aik live run par, skill is scenario aur model swap ke darmiyan v0.1.0 se v1.0.0 par jump kar gayi, aur change cosmetic nahin thi. Us ne aik clumsy method (aik raw curl call ko haath se chalana) chhor kar aik saaf method (apni built-in web search) le liya, aur us ne do sections add kiye jin ki usay zaroorat ka pata chala tha: aik "Common Pitfalls" list aur aik "Verification Checklist." Yeh loop theek wahi karta hua jo woh advertise karta hai: worker ko aik behtar tareeqa mila aur us ne apni hi instructions dobara likh dein. Jab aap upar wala compare prompt chalayein, yahi woh tarah ka diff hai jisay dhoondna chahiye.

5b. Brain switch karein, baqi sab rakhein

Yeh apne agent ko paste karein:

Ab saabit karo koi lock-in nahin. Isay aik alag model par switch karo, behtar ho aik sasta, taake mein check kar sakoon koi aur cheez nahin tooti. Phir aik aisa task dobara chalao jo 5a wali skill use kare taake mein wahi skill aur wahi memory aik alag model ke neeche kaam karte dekh sakoon.

Agent model switch karta hai (no code, skills ya memory ka koi re-config nahin) aur dobara chalata hai. Wahi skill. Aap ki wahi memory. Neeche aik alag model.

Aap Scenario 5 ke saath tab done hain jab: aik task aik doosre model par sahi mukammal ho, us skill aur memory ka use karte hue jo aap ne pehle model ke neeche banayi, aur aap ne dekh liya ho ke switch karna aik command tha, aik migration nahin.

"No lock-in" ka matlab swap aasan hai, yeh nahin ke har model barabar hai

Models switch karna aik command hai, aur yahi woh lock-in point hai, jo real hai. Jo woh waada nahin karta woh aik jaisi quality hai. Aik verbose skill ko aik saste, leaner model (kahein gpt-4o) par daalein aur output namayan tor par bura, yahan tak ke garbled bhi aa sakta hai, kyunke skill aik mazboot brain ko zehan mein rakh kar likhi gayi thi. Fix saste model ko chhorna nahin; yeh skill ko tab tak tight karna hai jab tak aik chhota model usay saaf follow kar sake. No lock-in dono taraf kaam karta hai: aap paisa bachane ke liye neeche jaane ke liye azaad hain, aur instructions ko tab tak sharp karne ke liye azaad hain jab tak saara model tik jaaye.

Lock-in inversion: aap ki memory, skills, aur identity apni jagah rehti hain jab ke neeche model aik command mein swap hota hai: model woh hissa hai jo rented hai.


Scenario 6: Usay khud act karwayein, phir brain back up karein (~15 min)

6a. Aik scheduled job, plain language mein

Yeh apne agent ko paste karein:

Aik scheduled job natural language mein set up karo aur usay mere phone par deliver karo: har weekday ko subah 8 baje, aik chhota morning digest jo us se bana ho jo tum mere baare mein pehle se jaante ho, mere notes, aur jo hum ne haal mein kiya, aaj meri tawajju ke layak top do ya teen cheezon ke saath. Save karne se pehle mujhe schedule dikhao, aur usay abhi aik dafa chalao taake mein usay Telegram par land karte dekh sakoon bina kal ka intezaar kiye.

Aap ka agent job banata hai (aik natural-language schedule, delivery aap ke home channel par), aap ko schedule dikhata hai, aur aik test run trigger karta hai taake digest abhi aap ke phone par aaye. Yeh pehla job jaan boojh kar aisa hai jisay kisi bahar ke tools ki zaroorat nahin: yeh digest aap ki memory files aur haaliya notes se banata hai, jo aik scheduled run ke paas hamesha hath mein hote hain. Agar message kabhi land na kare, to job ke bajaye delivery weak link ho sakti hai. Kuch regions mein Telegram throttled ya blocked hota hai (aap log mein api.telegram.org connection failed dekh sakte hain), aur fix yeh hai ke apne agent se home channel Signal ya Discord par switch karwa kar dobara chalwayein.

Scheduled jobs aap ke live chat se leaner chalti hain

Aik scheduled run wahi setting nahin jo woh chat jo aap use kar rahe the. Yeh aik fresh session mein aik chhote toolset ke saath jaagti hai: by default us ke paas koi web search aur koi messaging tools nahin (scheduler khud delivery sambhalta hai, to yeh phir bhi aap ke phone tak pohanchta hai). Yahi theek wajah hai ke upar wala pehla job aap ke apne notes aur memory se banta hai, jo hamesha hath mein hote hain. Agar aap aik aisa scheduled job chahte hain jo web se taaza maaloomat pull kare, to yeh aik extra step hai: apne agent ko batayein woh us specific job ke liye web tool enable kare (woh job par enabled_toolsets=["web"] set karta hai, ya cron platform ke liye web on karta hai). Phir aap ka 8am brief sirf aap ke notes ke bajaye duniya research kar sakta hai. In mein se kuch bhi aik bug nahin; scheduled agents bas aap ke interactive session se kam ke saath chalte hain, jaan boojh kar.

6a done jab: aik scheduled job mojood ho AUR aik test run fire ho aur digest message aap ke phone par bina nigaraani land kare.

6b. Us worker ko back up karein jisay aap train kar rahe the

Ab tak Hermes ke paas kuch hai jo protect karne layak hai: aik skill jo us ne likhi, aap ka aik model, aik scheduled routine, jin mein se koi aik ghanta pehle mojood nahin tha. ~/.hermes/ ko us asset ki tarah samjhein jo woh hai.

Yeh apne agent ko paste karein:

Sab kuch back up karo taake mein woh na khoon jo us ne seekha hai, aur mujhe dikhao mein aik nayi machine par isay kaise restore karoon. Confirm karo ke backup ne config, skills, memories, aur sessions capture kar liye, batao woh kahan hai, aur restore step kahin save karo jahan mein baad mein dhoond loon.

Aap ka agent configuration, skills, memories, aur session store mehfooz tareeqe se back up karta hai chahe Hermes chal raha ho, aur codebase ko khud chhor deta hai.

Aik upgrade jo maangne layak hai: workspace ko aik one-off zip ke bajaye aik private Git repo mein back up karein. Phir us ki skills ko aik poori history milti hai, aur aap har woh skill dekh sakte hain jo agent ne likhi ya dobara likhi, aik timestamp ke saath. Woh history sab se sasta tareeqa hai yeh dekhne ka ke agent ka behavior waqt ke saath kaise badalta hai, aur agar us ne ghalat sabaq seekha to aik change roll back karne ka. Apne agent ko batayein woh private repo set up kare, secrets aur session caches exclude kare, aur har ahem change par commit kare.

Aap Scenario 6 (aur crash course) ke saath tab done hain jab: aik job khud se aap ke phone par chale, aik backup zip mojood ho, aur aap ke paas aik hermes import one-liner saved ho. Aap ka AI Employee ab aap ke sotay waqt kaam karta hai, aur aik dead laptop se bach jaata hai.


Scenario 7: Usay aik voice dein (bonus, ~10 min)

Maqsad: aap apne Telegram bot ko message karte hain aur woh bole gaye audio se jawab deta hai, free, aur bina koi nayi key set up kiye.

Yeh aik treat hai, requirement nahin. Jo kuch aap ne banaya woh is ke baghair bhi chalta hai. Lekin yeh takreeban das minute leta hai, aur yeh aap ke AI Employee ko aisi cheez bana deta hai jisay aap ghar wapas chalte hue sirf parhne ke bajaye sun sakte hain.

Yeh apne agent ko paste karein:

Mere Hermes ko aik voice do: isay aise set up karo ke jab mein apne Telegram bot ko message karoon to woh audio se jawab de jisay mein sun sakoon, free option use karte hue taake mein koi nayi key na add karoon.

Aap ka agent voice extra package aur ffmpeg (woh chhota audio tool jo speech file assemble karta hai) install karta hai, text-to-speech ko free Edge default par set karta hai, aur aap ke Telegram bot ke liye Auto Voice Reply on karta hai. Yeh sab non-interactive aur agent-run hai, to aap ke drive karne ke liye koi wizard nahin aur enter karne ke liye koi card nahin. Edge box se hi free hai; agar aap voice ke liye Gemini use karna chahein, woh bhi free hai, ussi key ke zariye jo aap pehle hi Scenario 1 mein rakh chuke hain.

Aap is scenario ke saath tab done hain jab: aap apne phone se apne bot ko aik message bhejte hain aur aik bola gaya jawab waapis aata hai jisay aap play kar ke sun sakte hain.

Aik aur option agar aap zor se baat karna pasand karte hain: aik microphone loop terminal se chalta hai, aap ke local mic par sunta hai, aap ki apni machine par transcribe karta hai, aur ussi free Edge voice se jawab deta hai. Apne agent se kahein woh aik hands-free chat ke liye CLI voice loop set up kare.


Aap ne kya banaya

Navvey minute mein aap nothing se aik aise AI Employee tak pohanche jo apni skills khud likhta hai (Scenario 3), bina poochay sessions ke across aap ko yaad rakhta hai (Scenario 4), kisi bhi model par chalta hai jo aap chunein (Scenario 5), aur bina nigaraani kaam karta hai (Scenario 6). Yahan se theek aik ongoing habit hai jo aap ke calendar ke layak hai (monthly audit, aage), aur phir aik map ke yeh kahan ja sakta hai.


Mahine mein aik dafa, aaj nahin: skills & memory audit (~10 min)

Aik self-improving agent ko ground truth dene ke liye aik insaan ki zaroorat hai. Akela chhor diya jaaye, to Hermes ghalat cheez par tez aur zyada confident ho sakta hai. Monthly habit woh hai jis se aap loop ko honest rakhte hain.

Jab waqt aaye to yeh apne agent ko paste karein:

Monthly check chalao: mujhe dikhao tum ne khud ko kya sikhaya bemuqabla jo mein ne install kiya aur kisi bhi stale ya risky ko delete karne ke liye flag karo, installed skills ko security issues ke liye dobara scan karo, aur jo tum ne mere baare mein record kiya us ka summary do taake mein kuch ghalat theek kar sakoon.

Teen cheezein jo waqai check karni hain: skills (hermes skills list dikhata hai agent ne apne liye kya likha bemuqabla jo aap ne install kiya, to jo bhi anjaana ho usay parhein; jo stale ho usay delete karein; agent apni cheezein ~/.hermes/skills/ mein category folders ke neeche file karta hai), memory (parhein us ne aap ke baare mein MEMORY.md / USER.md mein kya infer kiya aur jo off ho usay theek karein), aur supply chain (hermes skills audit installed hub skills ko security issues ke liye dobara scan karta hai, plus aik sakht usool ke aisi koi community skill kabhi na rakhein jo aap ne parhi nahin). Agar aap ne Scenario 6 wala Git backup set up kiya, yeh bhi woh waqt hai jab aap apne agent se woh history dikhwate hain, theek theek jo us ne pichle mahine se khud ko sikhaya. Aur agar aap chahein ke agent skills khamoshi se likhe hi na, Blank Slate setup mode skill-writing aur memory capture ko tab tak off rakhta hai jab tak aap opt in na karein.

Imandar ceiling

Theek se bayan karein ke yahan "self-improving" ka matlab kya hai, kyunke yeh jumla aasani se over-sell karta hai. Hermes apni memory aur skills curate kar ke behtar hota hai — model ko retrain kar ke, apna source dobara likh kar, ya runtime par apne prompt templates edit kar ke nahin. Training kabhi self-triggered nahin; neeche wala model wahi hai jo aap ne pick kiya. Jo badalta hai woh notebook hai, brain nahin. Yeh imandar version hai, aur yeh phir bhi waqai powerful hai: worker hafton mein aap ke kaam par tez hota jaata hai.

Asal risk runaway autonomy nahin; yeh khamosh drift hai — pakarna sab se mushkil theek un domains mein jahan aap aasani se agent ka kaam check nahin kar sakte. Aap woh haqooq rakhte hain jo isay manageable banate hain — MIT license, aap ki machine par aap ka data, skills jo aap plain markdown mein parh sakte hain aur hermes skills audit se dobara scan kar sakte hain, aik Blank Slate mode jo self-writing ko tab tak off rakhta hai jab tak aap opt in na karein, aur aik Git history. Nous aap ko woh haqooq deta hai; woh aap se unhein istemaal nahin karwa sakta. Audit aap ka unhein istemaal karna hai. Aik agent jo aap ka kaam seekhta hai woh sab se zyada qeemti worker hai jo aap banayenge aur sab se zyada check karne layak. Yeh Hermes par tanqeed nahin; yeh kisi bhi aisi cheez ki structural haqeeqat hai jo khud ko behtar karti hai.


Core scenarios se aage

Chhe core scenarios aap ko aik working, self-improving AI Employee de dete hain. Yeh section us ka map hai jo baad mein aata hai: chaar directions jin par ecosystem converge kar chuka hai, har aik aik detour ke bajaye aik natural agla step.

Isay apne real tools se connect karein

Aik self-improving agent jo aap ki duniya ko chhoo nahin sakta woh aik bohot smart notebook hai. Unlock connectors hai. Do routes:

  • MCP servers: open standard. Aap ka agent config.yaml mein aik server block jorta hai (GitHub, aik database, aik calendar) aur Hermes ko woh tools mil jaate hain. Behtar jab us cheez ke liye aik saaf MCP server pehle se mojood ho jo aap chahte hain.
  • Composio jaisa aik aggregator: aik connection jo Gmail, Google Calendar, Slack, Notion, aur saikron mein fan out karta hai, aik generous free tier ke saath. Aap aik dashboard mein per account aik dafa authorize karte hain; agent unhein aik single integration ke zariye call karta hai. Behtar jab aap breadth tez chahte hain bina har service ko khud wire kiye.

Jo usool isay mehfooz rakhta hai woh guardrails note wala hai: sab se kam connect karein jo aap ko chahiye, kisi bhi outbound cheez ke liye draft-over-send ko tarjeeh dein, aur "MCP candy store" ko resist karein. Har extra connector har prompt mein tool definitions jorta hai, to aik bloated toolbelt agent ko slower aur zyada confused banata hai, zyada capable nahin. Aik tool tab jorein jab koi real task usay chahta ho, pehle se nahin.

Aik seerhi, aik chhalaang nahin

Yeh jaanna madad karta hai ke aap kahan hain aur aage kya hai. Aik rough progression jis par community settle ho chuki hai:

  1. Download & go: one-shot tasks; aap ne yeh Scenario 1 mein kiya.
  2. Woh aap ko jaanta hai: memory aur aik SOUL/USER profile; Scenario 4.
  3. Commands & model-agnostic: model, personality, ya background behavior switch karne ke liye quick built-in commands, plus har job ke liye sahi model; Scenario 5.
  4. Integrator: email, calendar, Slack, aur MCP connectors wired in (upar).
  5. Orchestration: Hermes isolated sub-agents spawn karta hai jo parallel kaam karte aur report back karte hain, aik sasta model grunt work par aur aik mehnga supervise karte hue.
  6. Builder: woh real software ship karta hai aur scheduled, asynchronous kaam chalata hai jab aap door hote hain; Scenario 6 pehla rung hai.
  7. Aik operating system: Hermes, aap ke coding agents, aur aap ke notes memory share karte hain, to aik surface par kiya kaam doosron ko dikhta hai.

Aap is seerhi par naya theory seekh kar nahin charhte; aap aik aur tool connect kar ke ya aik aur task delegate kar ke charhte hain. Seven-level crowd ka imandar caveat rakhne layak hai: us cheez ko automate karein jo waqai aik bottleneck hai, us cheez ko nahin jo automate karne mein maza aata hai.

Open harnesses ke darmiyan Hermes kahan baithta hai

2026 tak open-source agent world teen layers mein bant chuka tha jo complementary hain, competing nahin:

  • OpenClaw: gateway. Breadth: har messaging channel par aik agent, sab se barri community skill marketplace. "Employee."
  • Hermes: learner. Depth: built-in learning loop, persistent memory, model-agnostic. "Aisa employee jis ki notebook kabhi khaali nahin hoti."
  • Paperclip: orchestrator. Woh agents ki teams ko aik company ki tarah chalata hai: org charts, per-agent budget caps, atomic tasks, aik append-only audit trail. "Agar OpenClaw employee hai, to Paperclip company hai."

Teen open harnesses, teen jobs: OpenClaw gateway (employee), Hermes learner (notebook jo kabhi khaali nahin hoti), aur Paperclip orchestrator (company), Hermes ke saath jo aik official adapter ke zariye Paperclip ke andar chal sakta hai.

Zyada tar serious setups in ko combine karte hue khatam hote hain. Hermes aik official adapter ship karta hai taake aik Hermes agent aik Paperclip company ke andar aik managed employee ke taur par chal sake, jo theek woh bridge hai Workforce with Paperclip crash course mein. Shape of problem ke hisab se pick karein: aik gehra personal agent → Hermes; har channel par reach → OpenClaw; budgets aur governance ke saath aik coordinated team → upar Paperclip.

Isay infrastructure ki tarah chalana

Upar wali har cheez aap ke laptop ya aik sastay always-on computer par theek chalti hai. Aik dafa aik agent waqai sensitive data sambhalta hai, to aap usay aik governed wrapper ke peechhe rakhte hain: kuch aisa jo us ke liye keys rakhta hai aur limit karta hai woh kya reach kar sakta hai, taake agent kabhi raw tokens na dekhe aur kisi aisi destination par bhatak na jaaye jo aap ne allow nahin ki. NVIDIA NemoClaw is shape ka sab se saaf public example hai. Shuru karne ke liye aap ko is mein se kuch nahin chahiye. Yeh bas woh hai jo "self-hosted, you own it" ban jaata hai jab agent real data par real kaam karne lagta hai: wahi AI Employee jo aap ne navvey minute mein set up kiya, ab aik seatbelt pehne hue.


Ab aap ke paas tasveer ke dono halve hain: OpenClaw aur Hermes. Agla faisla (kaun sa aik kisi job ko chahiye, ya woh dono chahta hai) aik aisa hai jo aap aakhir kar aik feature table ke bajaye tajurbe se kar sakte hain. Deep chapter upar wale har thread par aage jaata hai; yeh navvey-minute version tha.


Flashcards Study Aid

Knowledge Check

Un ideas par aik quick gated self-check jin se aap abhi guzre.

Checking access...