AdaptiveMind · Studio · Singh
Factory // Online·Synced 4s ago

Studio of Singh · India

[ 00 ] · Manifesto

Solo builder · First pilots open

I build machines
that build machines.

Orchestrated software delivery — multi-LLM pipeline, human-governed. 5 production systems live, 1 enterprise build under concept review. No paying customers yet. First 2-3 pilots · By application.

I design and run an orchestrated software delivery workflow built on reliability, verification, and repeatable shipping. Multiple LLMs run as independent roles; I review and ship. This site documents the live systems, production outcomes, and operating model behind the Studio.

Engagement

New build · Productionize · Operate

Slots

First 2–3 pilots · By application

First reply

Within 48 hours · From a human

FRAME / 001REC ●F /1.4
Field Sprint · 001·REC ●

Two production sites. Ten minutes. Nine of nine audit checks green.

Singh assigns · Claude orchestrates · Vercel ships · Audit verifies

Read the receipts

[ 01 ] · The Moat · Autonomous Factory

Autonomous Factory.
Governed delivery at scale.

The Autonomous Factory is a multi-LLM engineering pipeline that turns a spec into a verified artefact. It is contract-first: every run starts from a measurable blueprint (capabilities, contracts, OUT-OF-SCOPE, success criteria).

Multiple models run as independent roles - we try to keep authorship and verification separate. Gates default to fail-closed: if a contract breaks, tests fail, or drift is detected, the run stops. Fixes + failure modes are tagged as learnings and carried into the next run so the pipeline can improve over time.

Roles (Multi-LLM)

  • ORCH Orchestrator runs the loop
  • ARCH Architecture reviewer audits design
  • QC Quality gatekeeper scores & blocks drift
  • TWIN Digital twin reviews like the owner
  • VERIFY Final verifier checks blueprint ↔ artefact
Contract-firstTDDVerified-onlyLearnings reused
  1. 01

    Blueprint

    Contract-first planning

    Each build starts with a measurable spec: capabilities, contracts, an OUT-OF-SCOPE list, and success criteria with numbers. We try not to write code until the blueprint is agreed.

  2. 02

    Multi-LLM Review

    Architecture audit

    Before anything ships, the blueprint goes through an architecture review by an independent model. We iterate until the design clears a quality threshold.

  3. 03

    TDD Build

    Test-first execution

    Agents author code against tests written first. We aim to put assertions at each endpoint, contract boundary, and state transition before implementation arrives.

  4. 04

    Gate Swarm

    Quality gates in parallel

    Security, contract fidelity, dead-code, dependency budget, and PMO checks run in parallel. Gates default to fail-closed when a check is uncertain.

  5. 05

    Audit

    Independent verification

    A second model audits the build against the blueprint (not against the code). Drift between intent and artefact becomes a learning before we ship.

  6. 06

    Ship

    Captured + reused

    Failure modes from the run are tagged, written into the Project Learning Ledger, and carried into the next build prompt. The goal is steady improvement across projects.

[ 02 ] · Featured Work

Five things live in the world.
One under concept review.

Five live sites you can open right now. One enterprise system (WRIE) that is sim-validated and awaiting first live deployment. Each entry is honest about what shipped, what didn’t, and what stage it’s at.

UNIFIED RISK INTELLIGENCE PLATFORM60+ INTEGRATIONSSOURCES · 60+TenableQualysRapid7WizClarotyNozomiDragosArmisSplunkSentinelElasticSumoCrowdStrikeSentinelOneDefenderCortexOktaPingAuth0EntraPaloAltoFortinetCiscoZscalerURIPNORMALISATIONSCORINGENRICHMENTCISO DASHBOARDSAUTO-REMEDIATIONTICKETINGURIP · CVSS · EPSS · KEV · MITRE ATT&CK
[ 01 / 06 ]
LIVECYBERSECURITY · SAAS

URIP

Unified Risk Intelligence Platform

An enterprise cybersecurity SaaS that pulls vulnerability, threat, identity, and cloud-security data from 60+ tools (Tenable, Qualys, CrowdStrike, Splunk, Wiz, Okta and more) into a single platform. URIP normalises signals, runs a composite CVSS+EPSS+KEV score per risk, maps findings to MITRE ATT&CK groups, monitors the dark web for related leaks, and pushes prioritised actions out to dashboards, auto-remediation scripts, and Jira tickets. The goal: turn fragmented security telemetry into one ranked queue of work.

Integrations

61

Ari 0°-30°Mar 21-Apr 19♈︎🔥1Tau 30°-60°Apr 20-May 20♉︎🌍2Gem 60°-90°May 21-Jun 20♊︎💨3Can 90°-120°Jun 21-Jul 22♋︎💧4Leo 120°-150°Jul 23-Aug 22♌︎🔥5Vir 150°-180°Aug 23-Sep 22♍︎🌍6Lib 180°-210°Sep 23-Oct 22♎︎💨7Sco 210°-240°Oct 23-Nov 21♏︎💧8Sag 240°-270°Nov 22-Dec 21♐︎🔥9Cap 270°-300°Dec 22-Jan 19♑︎🌍10Aqu 300°-330°Jan 20-Feb 18♒︎💨11Pis 330°-360°Feb 19-Mar 20♓︎💧1225.4°Me ^4.7°Su +17.3°Ve11.2°Mo8.5°Ju +27.4°Ke *13.1°Ma +12.6°Ra *13.1°Sa20.9°00:00:00
[ 02 / 06 ]
LIVEASTROLOGY · PLATFORM

AstroRattan

Vedic-astrology platform

A Vedic-astrology platform built on Swiss Ephemeris-based planetary calculations. Generates 16+ divisional charts (D1, D9, D10, D60…), runs Kundli matching with Ashtakoota scoring, computes Vimshottari and Yogini Dashas, supports Lal Kitab remedies, and serves Panchang and Muhurat queries from one engine. Customers get a live transit wheel, personalised reports, and a practitioner-friendly workspace. Live at astrorattan.com.

Charts

17

FLEET: 18 · MOVING: 12 · YIELDING: 4 · CHARGING: 2TASKS: 27 · REROUTES: 15SYSTEM · DIGITAL TWINL1A1A2L2A3S1FIELD · LIVE FLEETL1A1A2L2A3S1AGVLIFTERSCOUTDOCKROUTEL1 · LIFTERA1 · AGVA2 · AGVL2 · LIFTERA3 · AGVS1 · SCOUT
[ 03 / 06 ]
SIM-VALIDATEDROBOTICS · DIGITAL TWIN

WRIE

Warehouse Robotics Intelligence Engine

A sim-validated warehouse-robotics digital twin: one C++ control core designed to bind to either a Gazebo backend or a real fleet at the device-driver layer. The codebase implements 18 robot models from 5 vendors over a VDA5050-aligned message bus, 213+ contract-bound API endpoints, MAPF over the warehouse graph, ASRS lifts, teleoperation, pick stations, and a fleet-replay engine. Awaiting first live deployment — under concept review with a warehousing partner. Read the full case below.

Endpoints (built)

213+

io-gitaANCIENT TEXT · MODERN PHYSICSPRAKRITITHE LANDSCAPEGUNATHE FORCESGATITHE TRAJECTORYIO-GITA · DETERMINISTIC · ZERO HALLUCINATION
[ 04 / 06 ]
LIVEREASONING · RESEARCH

io-gita

A physics engine for the mind

A reasoning engine that treats moral and strategic dilemmas as force fields. Describe the situation in plain language; the engine extracts active forces (Dharma, Artha, Kama, Moksha and their tensions), maps them onto a deterministic field, and shows where the trajectory settles. The Bhagavad Gita is encoded as a structured rule-set, not a chatbot—outputs are designed to be reproducible, auditable, and explainable.

Outputs

Deterministic

CLIMATE IOT · MUSHROOM GROW-ROOMLIVE · CLOSED LOOPADMIN · DASHBOARDESP32SENSOR · NODECONTROLLERTEMP21°CRH88%CO₂1180GROW ROOM · STAGE · FRUITINGDISTRIBUTION · MARKETPLACE
[ 05 / 06 ]
LIVEAGRI-IOT

Mushroom Ki Mandi

Climate IoT + marketplace for mushroom farms

A closed-loop IoT system for commercial mushroom growers. ESP32 nodes inside the grow room stream temperature, humidity, and CO₂ to a controller that automates fans, humidifiers, and grow lights against the recipe for the current growth stage (spawn run → pinning → fruiting → harvest). The platform also runs a grower-to-buyer marketplace so the same growers who use the climate control sell straight to retailers. Marketing site live; sensor stack tested on the bench, no paying farms yet — opening one pilot grow room as the first reference site.

Site

Live

EAGLE TAILORS · DIGITAL ATELIERBESPOKE · MADE-TO-MEASUREDT=ONCUTTHREAD010203040506070EAGLE TAILORS · MEASURE · CUT · STITCH · DELIVER
[ 06 / 06 ]
LIVEBESPOKE TAILORING · RETAIL

Eagle Tailors

Heritage tailoring as a digital atelier

A century-old bespoke tailoring shop brought online without losing the craft. Customers book consultations, the digital atelier captures and archives every measurement, the fabric library lets clients browse mill-grade options, and the workshop tracks each garment through cut → stitch → finish → delivery. Every order is a contract: same workflow, same artisans, same standards - now searchable, repeatable, and visible to both shop and customer.

Garments

Tracked

[ 03 ] · Learnings Ledger

What every shipped system
has already taught the factory.

A scrolling excerpt — 15 selected entries from the internal CL-FAC ledger of 177 recorded learnings. Every failure mode gets captured, tagged, and fed back into the next build. The memory layer that prevents repeated mistakes.

+CL-FAC-010 - Factory bypassed its own tools.+CL-FAC-013 - Orchestrator bypassed its own review score target.+CL-FAC-053 - Silent error swallowing epidemic.+CL-FAC-094 - Claude declares 'complete' with backend only.+CL-FAC-100 - Claude claims 'fixed' without running verification.+CL-FAC-114 - Blueprint claims not enforced as tests.+CL-FAC-148 - Dead code subsystems - functions never called from main flow.+CL-FAC-095 - 142 tests pass but 5 critical bugs in code.+CL-FAC-149 - Contract shape mismatches: 11 fields don't match blueprint.+CL-FAC-046 - PMO Gate 10 too weak - checks existence not content.+CL-FAC-101 - Factory fixes symptoms not causes.+CL-FAC-163 - Security gates must fail closed not open.+CL-FAC-058 - No measurable success criteria defined.+CL-FAC-061 - No explicit OUT-OF-SCOPE list.+CL-FAC-159 - Test files must test what their name claims.+CL-FAC-010 - Factory bypassed its own tools.+CL-FAC-013 - Orchestrator bypassed its own review score target.+CL-FAC-053 - Silent error swallowing epidemic.+CL-FAC-094 - Claude declares 'complete' with backend only.+CL-FAC-100 - Claude claims 'fixed' without running verification.+CL-FAC-114 - Blueprint claims not enforced as tests.+CL-FAC-148 - Dead code subsystems - functions never called from main flow.+CL-FAC-095 - 142 tests pass but 5 critical bugs in code.+CL-FAC-149 - Contract shape mismatches: 11 fields don't match blueprint.+CL-FAC-046 - PMO Gate 10 too weak - checks existence not content.+CL-FAC-101 - Factory fixes symptoms not causes.+CL-FAC-163 - Security gates must fail closed not open.+CL-FAC-058 - No measurable success criteria defined.+CL-FAC-061 - No explicit OUT-OF-SCOPE list.+CL-FAC-159 - Test files must test what their name claims.
+CL-FAC-010 - Factory bypassed its own tools.+CL-FAC-013 - Orchestrator bypassed its own review score target.+CL-FAC-053 - Silent error swallowing epidemic.+CL-FAC-094 - Claude declares 'complete' with backend only.+CL-FAC-100 - Claude claims 'fixed' without running verification.+CL-FAC-114 - Blueprint claims not enforced as tests.+CL-FAC-148 - Dead code subsystems - functions never called from main flow.+CL-FAC-095 - 142 tests pass but 5 critical bugs in code.+CL-FAC-149 - Contract shape mismatches: 11 fields don't match blueprint.+CL-FAC-046 - PMO Gate 10 too weak - checks existence not content.+CL-FAC-101 - Factory fixes symptoms not causes.+CL-FAC-163 - Security gates must fail closed not open.+CL-FAC-058 - No measurable success criteria defined.+CL-FAC-061 - No explicit OUT-OF-SCOPE list.+CL-FAC-159 - Test files must test what their name claims.+CL-FAC-010 - Factory bypassed its own tools.+CL-FAC-013 - Orchestrator bypassed its own review score target.+CL-FAC-053 - Silent error swallowing epidemic.+CL-FAC-094 - Claude declares 'complete' with backend only.+CL-FAC-100 - Claude claims 'fixed' without running verification.+CL-FAC-114 - Blueprint claims not enforced as tests.+CL-FAC-148 - Dead code subsystems - functions never called from main flow.+CL-FAC-095 - 142 tests pass but 5 critical bugs in code.+CL-FAC-149 - Contract shape mismatches: 11 fields don't match blueprint.+CL-FAC-046 - PMO Gate 10 too weak - checks existence not content.+CL-FAC-101 - Factory fixes symptoms not causes.+CL-FAC-163 - Security gates must fail closed not open.+CL-FAC-058 - No measurable success criteria defined.+CL-FAC-061 - No explicit OUT-OF-SCOPE list.+CL-FAC-159 - Test files must test what their name claims.
+CL-FAC-010 - Factory bypassed its own tools.+CL-FAC-013 - Orchestrator bypassed its own review score target.+CL-FAC-053 - Silent error swallowing epidemic.+CL-FAC-094 - Claude declares 'complete' with backend only.+CL-FAC-100 - Claude claims 'fixed' without running verification.+CL-FAC-114 - Blueprint claims not enforced as tests.+CL-FAC-148 - Dead code subsystems - functions never called from main flow.+CL-FAC-095 - 142 tests pass but 5 critical bugs in code.+CL-FAC-149 - Contract shape mismatches: 11 fields don't match blueprint.+CL-FAC-046 - PMO Gate 10 too weak - checks existence not content.+CL-FAC-101 - Factory fixes symptoms not causes.+CL-FAC-163 - Security gates must fail closed not open.+CL-FAC-058 - No measurable success criteria defined.+CL-FAC-061 - No explicit OUT-OF-SCOPE list.+CL-FAC-159 - Test files must test what their name claims.+CL-FAC-010 - Factory bypassed its own tools.+CL-FAC-013 - Orchestrator bypassed its own review score target.+CL-FAC-053 - Silent error swallowing epidemic.+CL-FAC-094 - Claude declares 'complete' with backend only.+CL-FAC-100 - Claude claims 'fixed' without running verification.+CL-FAC-114 - Blueprint claims not enforced as tests.+CL-FAC-148 - Dead code subsystems - functions never called from main flow.+CL-FAC-095 - 142 tests pass but 5 critical bugs in code.+CL-FAC-149 - Contract shape mismatches: 11 fields don't match blueprint.+CL-FAC-046 - PMO Gate 10 too weak - checks existence not content.+CL-FAC-101 - Factory fixes symptoms not causes.+CL-FAC-163 - Security gates must fail closed not open.+CL-FAC-058 - No measurable success criteria defined.+CL-FAC-061 - No explicit OUT-OF-SCOPE list.+CL-FAC-159 - Test files must test what their name claims.

[ 04 ] · Field Reports

Two systems. Two outcomes.
Both still running.

FILE · S/01PRODUCTION
SIMULATORPHYSICAL FLEETSAME C++ BINARY · 15K LOC · 213 ENDPOINTS · VDA5050

SYSTEM · WRIE · Robotics digital twin

One C++ binary. Simulator and the real fleet.

The same 15K-LOC C++ control core runs inside the digital twin and on the physical robots. 213+ API endpoints, 18 robot models from 5 vendors, VDA5050-aligned messaging. Teleoperation, ASRS, and pick stations all share the same contract surface.

LOC (shared)

15K

Robot models

18

Endpoints

213+

FILE · S/02INTERNAL
PATCH · 0142 · AUTH.TS · DIFF +2 / −1AUDIT LIVEORCHCLAUDEauth.tsv0142 · 11 LINES01 export async function login(req, res) {02 const { email, password } = req.body;03 if (!email || !password) return 400;04 const user = await db.users.find(email);05+ const user = await db.users.find(06+ sanitise(email)); // kimi: input validation07 if (user?.pw !== password) return 401;08+ await rateLimit(req.ip); // gemini: rate limit09 const token = sign(user, 15 * 60);10 return res.json({ token });11 }QCKIMIinput → sanitisedARCHGEMINIrate-limit addedTWINCODEXship it.KIMIGEMINICODEXGATE → MAINAUTONOMOUS FACTORY · 4 ROLES · 23 INTERNAL · 5 LIVE · 177 LEARNINGS

SYSTEM · The Autonomous Factory · Multi-LLM

The factory audited its own code - and fixed it.

A multi-agent orchestration system where each LLM plays a defined role: orchestrator, quality auditor, architecture reviewer, and a digital twin of the owner. In internal runs, the system has been able to surface issues, raise them, and patch them with human review. 23 internal projects have moved through the factory; 5 have become public live sites; 1 (WRIE) is under concept review.

Internal builds

23

Public live sites

5

CL-FAC learnings

177

[ 05 ] · Operating Tenets

Six convictions
the studio doesn’t negotiate.

Driveline engineering taught me that real systems have hundreds of configurations and zero tolerance for ambiguity. The factory taught me that AI becomes useful only when you govern it like any engineering process.

T/01

Contract first.

Every build at AdaptiveMind begins with a blueprint document that defines capabilities, interfaces, and an explicit OUT-OF-SCOPE list before implementation starts. The blueprint is treated as a working contract between the problem owner and the delivery pipeline: success criteria carry numbers, state machines carry defined transitions, and assumptions are listed as risks rather than hopes. When agents author code against this document, there is less ambiguity about what shipped and why it might have deviated. The blueprint is versioned, diffed, and reviewed by an independent peer-model before a single test is written. This discipline helps avoid a common failure in software delivery—building the wrong thing correctly—by forcing intent to pass verification before execution begins. A blueprint that survives review becomes the specification against which gates, tests, and final artefacts are measured.

T/02

Test first.

Tests are written before the code they verify, not after the fact. In the AdaptiveMind pipeline, endpoints, state transitions, and contract boundaries have assertions defined before implementation arrives. The build swarm generates test cases from the blueprint, including property-based checks, boundary-condition probes, and soak scenarios that can run for thousands of iterations. If a test fails, the gate fails closed: the build stops, the drift is tagged with a CL-FAC code, and the failure mode is written into the Project Learning Ledger before a fix is attempted. This inversion—specification as oracle, implementation as candidate—reduces the confirmation bias that can creep into post-hoc testing. We treat a system as ready only after an independent verifier has exercised injected faults and confirmed recovery within the contract window.

T/03

Verify independently.

We try not to have the same agent both author and certify. At AdaptiveMind, each artefact that passes the build stage is submitted to an independent peer-model audit that checks the implementation against the blueprint (not against the code). This audit runs as a separate LLM instance with no access to the reasoning trace of the authoring agent, pushing it to verify behaviour from the contract surface alone. The auditor scores quality against declared success criteria, flags drift between intent and artefact, and issues a pass-or-block verdict before shipping. This separation mirrors a principle from safety-critical engineering: the same mind shouldn’t be both builder and inspector. The factory enforces it by architecture, not by policy, because institutional memory degrades when the same agent writes, tests, and certifies.

T/04

Capture & reuse.

Every regression at AdaptiveMind is treated as a signal, not a shame. When a gate fails—whether from contract drift, a security finding, or a boundary-condition miss—the failure mode is tagged with a CL-FAC code, documented in the Project Learning Ledger, and carried into the prompt context of the next build. The intent is to reduce repeat failures: a buffer-overflow found in the WRIE autonomy core can become a permanent test case for subsequent C++ builds, and a schema-mismatch caught in URIP can become a validation rule for future API designs. The ledger is the institutional memory of the operation, and it compounds when it’s kept honest. Over time, the hope is that projects ship with fewer avoidable surprises because prior failure modes are already encoded into gates and tests.

T/05

Three worlds, one thread.

Singh moves between driveline engineering, autonomous software delivery, and reasoning research as one practice. The discipline that keeps a VDA5050 message bus reliable at 100 Hz is similar to the discipline that reduces drift in a multi-LLM pipeline: explicit state machines, clear invariants, and a low tolerance for ambiguity. The WRIE warehouse graph, the URIP risk-scoring pipeline, and the io-gita force-field engine share a design lineage—state transitions are enumerated, failure modes have recovery paths, and assumptions are written down before they become bugs. The goal is to treat reliability as a property of the specification, not the domain, whether it’s a MAPF planner coordinating robots or an audit gate coordinating models.

T/06

Architecture, not magic.

AI becomes more useful when it is governed by the same engineering disciplines that govern bridges, drivelines, and medical devices: explicit contracts, independent verification, and accountable failure modes. The AdaptiveMind factory treats each LLM as a component in a larger system, not as an oracle. The orchestrator dispatches work, the architecture reviewer audits design against the blueprint, the quality gatekeeper scores output with numerical thresholds, and the digital twin reviews like the owner—each role is constrained by rules, prompt templates, and success criteria that predate the run. When a model drifts, the gate should catch it and write a CL-FAC entry. When a contract breaks, tests fail closed. It’s less magic than systems engineering, and systems can be inspected, versioned, and improved over time.

[ Now ] · On the bench

Currently in motion.

What’s on the bench right now - projects past prototype and short of ship. Updated monthly. None of these are committed roadmap; they are the things the studio is actively spending hours on.

N/01

Active research

io-gita · applied research

Extending the io-gita SG-Engine - the topological reasoning core that ships in io-gita.com - into a decision router for production incidents. Project 22 maps every state transition into a directed graph at α-density 36.6%, routes multi-hop plans through reachable basins, and refuses paths that don't exist. Current test bed: incident-replay traces from WRIE and URIP. Next: bridge synthesis (validate-before-apply) on a real warehouse decision dataset.

Project 22 · sg_engine · cross-cutting

N/02

Phase 1 shipped · 126 / 126 tests

PPF Workshop Monitoring · Phase 2

ESP32 sensors (DHT22, BME680, PMS5003) stream temperature, humidity, and particulate counts from each paint-protection bay over MQTT into a FastAPI / PostgreSQL backend. React dashboard shows real-time bay state. Hikvision RTSP cameras pipe live video into MediaMTX with WebRTC playback. Hardware kit at ₹16,250 per pit + ₹1,500 / pit / month. Phase 2 adds the customer-facing public job tracker and the audit log on every alert. Next: field trial across three workshops.

Project 21 · IoT · ESP32 + FastAPI + Hikvision

N/03

Prototype running locally

URIP · Personal

URIP's enterprise risk-intelligence pipeline scaled down to a one-engineer stack. FastAPI + React + SQLite. Five scanners wired in: Nuclei (websites), Lynis (workstation), holehe (email-breach), subfinder (subdomain enumeration), pip-audit (Python dependencies). Same composite-score formula as the enterprise version - 0.55·CVSS + 2.5·EPSS + KEV bonus + asset bonus. First asset scored at 64 / 100. Goal: a single dashboard for the security posture of one engineer's whole digital footprint.

Project 34 · URIP-derived · personal

Updated April 2026 · cadence: monthly

[ 06 ] · Engagement

Bring me a problem
that ought to have been solved
last year.

Open for selective engagements across applied AI, autonomous systems, production IoT, and high-stakes production software. Replies within forty-eight hours - yes, no, or a sharper question.

- Engagement type

We read every message. Typical reply within 48 hours.

- Response window

≤ 48 hrs

Replies, every channel

- Capacity

02 - 03

First pilots · By application

- Channels