Vision for AI-Driven Development
This document outlines the philosophy underlying AI agent architecture (MCP, Skills, and Agent integration) and the fundamental approach to AI-driven development.
Audience: Engineers interested in AI-driven development. Whether you're a practitioner evaluating MCP/Skills adoption or a decision-maker considering team-wide integration, this document provides a foundational perspective.
Position of This Page
👉 This page (WHY — Why we need authoritative references)
- 02-reference-sources (WHAT — What to use as references)
- 03-architecture (HOW — How to structure the system)
- 04-ai-design-patterns (WHICH — Which pattern to choose and when)
- 05-solving-ai-limitations (REALITY — How to face real-world constraints)
- 06-physical-ai (EXTENSION — Extending the three-layer model to the physical world)
- 07-doctrine-and-intent (DOCTRINE — On what basis should AI judge and act?)
Meta Information
| What this chapter establishes | AI's four fundamental limitations (accuracy, currency, authority, accountability) and the need for "authoritative references" |
| What this chapter does NOT cover | Specific reference source selection (→02), architecture design (→03), implementation techniques (→04) |
| Dependencies | None (starting point of the Concepts section) |
| Common misuse | Concluding "AI is unusable." This chapter's argument is "recognize the constraints, then structurally compensate" |
Core Understanding
AI is "Not Omnipotent"
While AI capabilities are rapidly advancing, it is crucial to correctly recognize their limitations. To avoid over-reliance on AI and use it appropriately, we need to understand the following constraints.
AI generates outputs probabilistically from training data, but cannot guarantee the following:
| AI Limitation | Description |
|---|---|
| Accuracy | Hallucination problem - may generate information that differs from facts |
| Currency | Does not have information beyond the training data cutoff |
| Authority | Cannot guarantee official interpretation of specifications |
| Accountability | Cannot provide grounds for legal or ethical judgments |
Therefore, we need to connect to reliable sources.
The Essence of AI-Driven Development
AI-driven development ≠ Having AI write code
AI-driven development = Utilizing AI throughout all processes while humans retain judgment and creativityWhat "All Processes" Means
This is not limited to coding. It refers to the broad range of management domains involved in software development — including project management, product management, SRE, security, and data management — as outlined in Management of Software Systems and Services.
However, realizing this ideal requires a prerequisite. For AI to be truly useful "across all processes," it must be placed in an environment where it can make correct judgments.
The Reality During This Transitional Period
The era where AI autonomously completes every process has not yet arrived. AI excels at "generating plausible output," but it cannot judge on its own whether that output is correct.
At the same time, the code AI generates today depends on abstraction layers — frameworks and libraries — whose foundations are the standards and specifications that humanity has accumulated over time.
Although the code AI generates is built on top of these standards, AI cannot directly reference them (dashed line). This is the core problem.
For AI to function correctly, it must be able to reference the same standards and specifications that its generated code ultimately depends on. This is why "unwavering reference sources" are necessary.
The Importance of "Unwavering Reference Sources"
Why Reference Sources Are Needed
| AI Challenge | What Reference Sources Solve |
|---|---|
| Fixed point-in-time training data | Access to authoritative up-to-date sources |
| Hallucination | Provision of verifiable evidence |
| Interpretation variance by context | Consistent decision criteria |
| Lack of latest information | Retrieval of current specifications |
Two Means to Achieve "Unwavering Reference Sources"
MCP and Skills serve as means to provide AI with "unwavering reference sources."
| Means | Role | Examples |
|---|---|---|
| MCP | Dynamic access to external authoritative sources | RFC, legislation, W3C standards |
| Skills | Systematization of domain knowledge and best practices | Design principles, workflows, coding standards |
A note on "Skills" terminology
In this document, "Skills" refers to Markdown-based systematization of domain knowledge, following the format defined by vercel-labs/skills. Unlike OpenAI's "Actions" or LangChain's "Tools," Skills are not executable code — they are structured knowledge and judgment criteria that AI references.
Essential Definition of "Unwavering Reference Sources"
An "unwavering reference source" is a fact retrieved from a verifiable information source, not an LLM's speculation.
Based on this definition, reference sources can be classified into two types:
| Type | Characteristics | Examples | Verification Method |
|---|---|---|---|
| Static Reference | Content is fixed and immutable | RFCs, legislation, W3C specs | Version / section number |
| Dynamic Reference | Values change, but are factual at the time of retrieval | Sensors, APIs, real-time data | Timestamp + data source ID |
Both share the property of "not being speculatively generated by an LLM." Dynamic references require separate verification of the data source's authenticity, but retrieving them via MCP ensures clear provenance.
Value of Reference MCP/Skills
- AI decisions become verifiable - Can demonstrate the basis for outputs
- Consistent quality is supported - Standards-aligned outputs
- Vendor lock-in is avoided - Based on open standards
- Access to knowledge is democratized - Reach accurate information without being an expert
- Domain knowledge becomes reusable - Formalize team know-how as Skills
Democratization of Knowledge
Problems with the Traditional Approach
- High cost
- One-way
- Language barriers
The World MCP/Skills Enables
Development based on accurate information becomes possible without relying on expensive consultants or specialists.
For how to distinguish between MCP and Skills, see skills/vs-mcp.md.
Three Axes of Knowledge Transformation
Knowledge transformation in AI-driven development is not one-directional. This architecture defines the following three transformation axes.
| Axis | Direction | Purpose | Example |
|---|---|---|---|
| ① Structuring | Human → AI | Transform authoritative sources into AI-accessible formats | RFC → rfcxml-mcp |
| ② Comprehension | AI → Human | Transform complex information into understandable formats | RFC 3161 → Checklist |
| ③ Verification | Spec → Test | Convert specifications into verifiable criteria | EPUB 3.3 requirements → JSON test suite |
Axis ③ "Verification" differs from ① and ② in that it forms a quality closed loop. Simply passing specifications to AI does not make its output verifiable — converting specifications into tests is what makes "driving" possible.
Human → AI (Structuring) Knowledge Transformation
Enable AI to access "unwavering reference sources."
Structuring External Information Sources via MCP
| Human Knowledge | Structured Format | AI-Usable Form |
|---|---|---|
| Legal text | e-Gov API | hourei-mcp |
| Technical specifications | RFC XML | rfcxml-mcp |
| Web standards | W3C/WHATWG | w3c-mcp |
| Translation rules | Glossary | DeepL Glossary |
Systematizing Domain Knowledge via Skills
| Team Knowledge | Format | AI-Usable Form |
|---|---|---|
| Design principles | Markdown | frontend-design skill |
| Coding standards | Markdown | coding-standards skill |
| Workflows | Markdown | doc-coauthoring skill |
AI → Human (Comprehension Support) Knowledge Transformation
Enable humans to access accurate knowledge even without being specialists.
| Complex Information Source | AI Processing | Human-Understandable Form |
|---|---|---|
| RFC 3161 (135 requirements) | Extraction/Classification | Checklist |
| Digital Signature Law + RFC | Mapping | Correspondence table |
| Technical specifications | Visualization | Mermaid diagrams |
| English RFCs | Translation | Explanations in local language |
Division of Roles Between Humans and AI
The Responsibility Shift Model
As abstraction rises, the responsibilities of accuracy, reliability, and judgment do not disappear — they shift who holds them.
| Responsibility Phase | Owner | Scope | Verification Mechanism |
|---|---|---|---|
| Design-time | Human | Selection of reference sources, structural design, defining judgment criteria | Spec-to-Test conversion |
| Execution-time | Agent | Reasoning and task execution based on reference sources | Evaluation pipeline (probabilistic quality gate) |
| Structural constraints | System | Consistency of references, access control, audit trails | Guardrails (inviolable constraints) |
If these responsibility boundaries remain ambiguous as abstraction increases, a situation arises where "no one is accountable." This architecture aims to make these boundaries explicit at the design level, using two verification layers:
| Verification Layer | Nature | Example Criteria | Role |
|---|---|---|---|
| Guardrails | Inviolable (boundary-based) | ESLint errors = 0, type checks pass | Defines "lines that must not be crossed" |
| Evaluation Pipeline | Probabilistic (threshold-based) | xCOMET >= 0.85, test coverage >= 80% | Defines "acceptable ranges" |
Traditional TDD (Red-Green-Refactor) does not apply directly, but test-first thinking — converting specifications into verifiable forms before implementation — remains fundamentally valuable in AI-driven development.
Complementary Structure
The diagram below illustrates how human capabilities and AI capabilities complement each other to achieve better development outcomes.
Basic flow of MCP, Skills, and Agent
Here is the fundamental flow that shows how user input flows through the agent core and tool integrations to produce results. For the governance layer (judgment criteria, constraints, and objectives) that governs all these layers, see the Doctrine Layer.
Positioning of This Repository
This repository is a place to organize the design philosophy, architecture, and practical know-how of AI agent architecture (MCP, Skills, and Agent integration), and to document strategies for building "unwavering reference sources" as the foundation of AI-driven development.
What This Document Does Not Guarantee (Non-goals)
To clarify the scope of this document, we state the following explicitly.
| What this document claims | What this document does NOT guarantee | What it provides instead |
|---|---|---|
| Providing verifiable reference sources improves AI judgment quality | Factual correctness of all AI outputs | Spec-to-Test verification pipeline |
| MCP/Skills provide structural constraints | Elimination of human review | Two-layer structure: guardrails (inviolable constraints) + evaluation pipeline (probabilistic quality gate) |
| The design aims to clarify responsibility allocation | That the system assumes legal or ethical liability | Design-time responsibility boundaries + runtime audit trails |
Reader contract: This document presents a design philosophy for "how to place AI in a trustworthy environment." It does not promise specific quality levels or safety guarantees. However, rather than leaving outputs unverifiable, it adopts an approach of converting specifications into verifiable tests and ensuring quality through a two-layer system of guardrails and evaluation pipelines. Final judgment and accountability always remain with humans.
Core Messages
- AI-driven development is not just code generation - Utilize AI throughout all processes
- AI needs guidelines for decision-making - The importance of unwavering reference sources
- Systematize human engineering knowledge - Formalize as MCP/Skills
- Standards-based MCPs are the foundation - Democratize access to RFC, W3C, legislation, etc.
- Share domain knowledge via Skills - Make team know-how reusable
- Bidirectional knowledge transformation - Human→AI (structuring), AI→Human (comprehension support)
- Explicit judgment criteria - Define constraints, objectives, and judgment criteria via the Doctrine Layer to enable autonomous AI decision-making