Profile-Driven Co-Pilot
Onboards once, remembers your stack, standards, and decisions. Soul + per-project memory keep context across every session — no more repeating yourself.
agenIT remembers your codebase, walks the work from idea to a tested PR, and proves it on real hardware — running on the LLM you already pay for. Your source code never leaves your laptop.
$ npm install -g @agenit/cli · works with your existing LLM auth · free during early access
Your standards, decisions, and naming — recalled on every session.
Idea to a tested PR, walked stage by stage with approval gates you control.
J-Link · CAN/LIN · Saleae — flashed, captured, and decoded in the loop.
Local-first by default. Your LLM, your machine, your contract.
agenIT speaks to the same probes on your desk. It flashes the board, streams RTT logs, decodes the CAN bus, captures the logic analyzer — then writes the fix, the test, and the traceability that proves it works on real hardware, not just in a chat window.
Of bus + log events on the bench are captured, decoded, and linked to a test — automatically.
Reproduce on hardware, fix MISRA-clean, add the HIL regression, and ship the evidence pack — in one uninterrupted run.
Illustrative · pilot teams in automotive embedded.
You already use an AI. agenIT is the layer around it that makes it useful for real software — the kind that ships, gets tested, and ends up in production.
Not a chat. It writes down your style, your decisions, and the names you use, and reads them back into every session. No more re-explaining.
Walks Plan → Code → Tests → Audit. Stops at approval gates you can move through one click, or auto-approve when the change is small.
Use the one you pay for: Claude, Gemini, GPT-4, or a local Ollama box. Same workflow, same skills — just point at your endpoint.
Your source never leaves your laptop. Only the AI's instructions stream in, get used, and get wiped when you're done.
Cheap helper models read the codebase in parallel; one capable model writes the answer with that context already in hand. Faster and cheaper.
Web app, mobile, CLI, embedded firmware — switch profile, same product. Hardware folks get J-Link / CAN / Saleae built in.
| Capability | Copilot | Cursor | Claude Code | agenIT |
|---|---|---|---|---|
| Memory across sessions | — | limited | session | plaintext markdown · git-diffable |
| Bring your own LLM | OpenAI | few | Anthropic | 6 backends incl. Ollama |
| Code stays on your machine | no | no | yes | yes · plus IP wipe on exit |
| Finishes the job (Plan→Test→Audit) | — | — | manual | /run walks it |
| Approval gates | — | — | — | configurable per stage |
| Hardware (J-Link / CAN / Saleae) | — | — | — | first-class |
| Open & inspectable | no | no | partial | open runner · signed assets |
Most engineering teams already have an LLM. What they don't have is the discipline, memory, and integrations to make it pay back. Here's what agenIT removes from the week.
Illustrative · based on pilot teams in automotive embedded and regulated SaaS.
Same incident, two worlds. The example below is from an embedded team — but the shape is identical for SaaS: a stale on-call, a ticket nobody can root-cause without waking three more people, a fix that lands without tests. agenIT wakes the squad instead of the team.
Most AI dev tools want your codebase on their servers. Fine for a side project — not fine if you work on anything proprietary. agenIT flips it: the tool runs on your machine, the AI's instructions come to you, and they're cleared from disk when you're done. The wire only carries your licence going out and the AI's skills coming in. Your source code? Never moves.
The wire only carries a quick licence check going out, and the AI's playbooks coming in. Your code, your prompts, and your memory stay on your laptop.
The runner is open and auditable. You can lock it to a specific release and verify the signature.
AI playbooks arrive when a session needs them and never sit on your disk between runs.
Anything we deliver is removed from your machine the moment you close the session.
Point at a local model (Ollama) and a local cache and run fully offline. Nothing leaves your network.
Three lenses on the same shift: time-to-resolve, what ships with each release, and how much throughput a squad gains in a sprint. These are directional averages from early-access teams — not a guarantee, not a marketing number.
Directional · averaged across early-access teams in automotive embedded and regulated SaaS. Detailed pilot report available under NDA.
Eight pillars that turn an LLM into a disciplined engineering teammate — not a chat window.
Onboards once, remembers your stack, standards, and decisions. Soul + per-project memory keep context across every session — no more repeating yourself.
Five stages, six commands (/swe1–/swe6), one /run that walks them. Each stage writes a real file (requirements.md, decisions.md, traceability.json) and stops at a configurable approval gate.
Six backends: Gemini CLI (default, free OAuth), Claude CLI, Antigravity (agy), Anthropic SDK, OpenAI, and Ollama for air-gapped. agenIT picks per config — no vendor lock-in.
Parallel helpers in phases — research (codebase-mapper, pattern-finder, dependency-grapher) → critique (security-reviewer, test-gap-scanner, complexity-grader) → synthesis. Workers on cheap models, planner on capable.
First-class JLink, CAN/LIN, and Saleae Logic 2 integration. Arduino-as-HIL turns dev boards into test nodes for real embedded validation.
Unified marketplace for MCP servers, Claude skills, and Gemini extensions. Compose your toolchain without forking the CLI.
Nine-phase SpecKit pipeline takes you from constitution to issues, with PMO standards enforcement and budget tracking built in.
/overnight runs a git-backed Plan → Code → Test → Evaluate loop with worktree parallelism and per-iteration rollback safety.
Most AI tools stop after "here's some code". agenIT keeps going — it writes down what you wanted, decides how, codes it, tests it, and checks that everything ties back. Each step writes a real file in your repo and pauses for a quick Approve.
Under the hood it's the ASPICE V-Model — the same workflow safety-critical teams use. You don't have to care about that to use it.
Dashed lines pair each design artifact with the verification artifact that proves it. agenIT enforces the pairing — /swe6 regenerates traceability.json and fails if any SWR-NNN isn't linked to a file and a test. Type /run to walk the whole chain end-to-end with approval gates.
ALM (Codebeamer, DOORS, Polarion, Jama), PM (Jira, ADO), CI (Jenkins, GitLab), debuggers (J-Link, Lauterbach, OpenOCD), bus & logic (Saleae, PEAK, Vector), IDEs, and chat — all reachable from one REPL via MCP servers and skills.
Already pay for Claude? Use it. Have a Google Workspace account? Free tier works. Air-gapped lab with an Ollama box? That works too. Flip one line in agenit.toml — the V-Model, the squad, and every skill keep working.
gemini-clidefaultFree tier plus Pro/Ultra Google accounts. Spawns gemini -p ... as a subprocess. No API key needed.
claude-clisubscriptionReuses your Claude.ai Pro / Max / Team / Enterprise login. Trusts the CLI's built-in tools.
antigravity-clinext-genagy · OAuth or APIGoogle's successor to gemini-cli after 2026-06-18. Skills folder layout: .agents/skills/*.md.
anthropic-sdkAPI keyDirect API path for headless CI and machines without the claude binary. Set ANTHROPIC_API_KEY.
openaiAPI keyCross-provider parity. Useful for benchmarking the same prompt across models.
ollamaair-gappedZero outbound traffic. Pair with the local orchestration cache for a fully offline install — no licence check on each run.
[backend]
provider = "claude-cli" # gemini-cli · claude-cli · antigravity-cli
# anthropic-sdk · openai · ollama
worker_model = "haiku" # squad helpers — cheap tier
planner_model = "sonnet" # primary agent — capable tier
[squad]
auto_squad = true
max_concurrent_agents = 4A stateful TypeScript CLI drives your chosen LLM, which delegates to a Python bridge for hardware and heavy parsers.
A mock of the agenIT REPL so you can feel how it works before installing. Type a slash command or anything in free form — outputs are simulated, but the flow is the real one.
A REPL designed like an OS shell — every workflow, integration, and persona is one command away.
agenIT isn't just a code co-pilot. It ships with a Project Management Office layer that plans sprints, tracks budgets, enforces standards on every activation, and reports back without anyone opening a spreadsheet.
/sprint plan turns a goal into stories, slices them into squad-sized tasks, and assigns each to the right agent — with story points and dependencies.
Every goal carries a turn budget and an LLM-token budget. /goal status shows burn-down per stage; agenIT stops at the gate when budget is depleted.
PMO templates (naming, commit style, doc layout, branch policy, license header) are loaded with the profile and enforced via pre-tool policy hooks.
/sprint review compiles velocity, burn-down, requirement coverage, MISRA findings, and per-agent metrics into a single markdown report.
/risk runs an impact analysis against the traceability graph and flags downstream tests and requirements affected by the change.
/report rolls up tasks, approvals, decisions, and budget across projects — exportable to Confluence, Notion, or your SIEM.
Five flavours of extension — Skills (what the AI can do), Plugins (where it lives — VS Code, JetBrains, Slack, CI), Tools (MCP servers for your toolchain), Profiles (full preset bundles), and Recipes (one-shot prompt chains). Install with a slash command. Publish your own.
misra-c-2012 · owasp-asvs-audit · tdd-loop
vscode-repl · slack-jobs · github-actions
codebeamer-mcp · jlink-bridge · jira-issues
automotive-ecu · saas-typescript · medtech
migrate-doors-to-ears · rewrite-as-tdd
You already pay for an LLM. The added value isn't more tokens — it's orchestration, memory, governance, and integration that turn the model into an accountable engineer inside your existing process.
Plug in Gemini, Claude, GPT-4.1, Azure OpenAI, AWS Bedrock, on-prem vLLM or llama.cpp. Same skills, same REPL, your endpoint.
Memory, code, and prompts stay in your VPC or on the developer's machine. No telemetry, no training-on-your-data clauses.
Sign and pin skills, MCP servers, and extensions. PMO templates enforce coding standards and budget at activation time.
Every approval, tool call, and file write is logged. Export to your SIEM. Reproduce any artifact from the memory snapshot.
Prompts go from your machine directly to the LLM endpoint you configured. agenIT never proxies your code or stores your prompts.
Built for teams that can't afford to leak code or trust opaque vendors. Every layer is local-first, inspectable, and consent-gated.
Persona, project memory, and code search index live on disk under your repo or $HOME — never uploaded by agenIT.
We orchestrate; we don't proxy. Your prompts go directly from your machine to the LLM endpoint you configured.
Zero analytics by default. If you opt in to anonymous usage stats, the schema is documented and inspectable in the portal.
Lock agenIT to a build, sign skills, pin MCP versions. Nothing updates without your explicit approval.
Run fully offline with a local model (Ollama / vLLM) and local MCP servers — JLink, Saleae, filesystem, git.
Every shell command, file write, and external call passes through a hook chain you control.
AUTOSAR / CAN / LIN drivers with ASPICE-compliant traceability from CRS to test report. Bootloader, UDS, OTA — all profiled.
MISRA / CERT-C aware code, HIL validation loops, audit-ready evidence packets for IEC 62304, DO-178C, ISO 26262.
Import legacy DOORS / Codebeamer / Polarion baselines into EARS, generate gap analysis, push back enriched items.
Overnight /goal Plan → Code → Test → Evaluate cycles with deterministic rollback and per-iteration approval.
Connect a J-Link or Saleae and let the squad reproduce flakey signals, bisect commits, and write the regression test.
Auto-generate traceability matrices, MISRA reports, OWASP audits, and a release dossier reviewers actually accept.
Same V-Model discipline, swapped profile: user stories, Playwright tests, OWASP / a11y audits, Lighthouse budgets.
Ship internal skills, MCP servers, and PMO templates so every team inherits standards without retraining.
We tried Cursor and Claude Code first. What we kept was agenIT — because we couldn't ship the actual codebase out, and because every other tool forgot what we agreed on yesterday.
Quote attributable on request · pilot under NDA.
Free during early access. Pay only when you'd notice we stopped working. No card needed today.
Final GA pricing may change · all plans local-first by default · LLM token costs are paid to your model provider, not to agenIT
The CLI is one npm package. Pick any of the six LLM backends and bring your own auth — Google OAuth, Claude subscription, API key, or a local Ollama box.
# 1. Install the CLI $ npm install -g @agenit/cli # 2. Activate with the JWT you got by email $ agenit activate ~/Downloads/licence.jwt # 3. Pick a backend (Gemini CLI is the default) $ npm install -g @google/gemini-cli && gemini auth login # 4. Walk the V-Model on your project $ cd my-project && agenit [my-project] flow> /run requirements/CRS.pdf → /swe1 requirements.md (12 SWR-NNN extracted) → /swe2 decisions.md (ADRs drafted) → /swe3 decisions.md (component design refined) → /swe4 source + @req tags (MISRA-clean) → /swe5 tests/ (unit + integration + BDD) → /swe6 traceability.json (✓ 100% coverage)
Free during early access. No credit card. Use the LLM you already pay for. We'll email your licence within one business day.