Design with CARE. · AI Agents for Science

Authors: Rahul Ramachandran, Nidhi Jha, Muthukumaran Ramasubramanian

Collaborative Agent Reasoning Engineering (CARE) is a disciplined, stage-gated methodology designed to systematically engineer AI agents for scientific and technical workflows. Influenced by the vision of Accelerated Knowledge Discovery (AKD), CARE moves away from ad hoc “prompt tinkering” toward a structured engineering process centered on reusable design artifacts and human-in-the-loop oversight.

The Core Methodology: Triadic Collaboration

CARE is defined by a three-party workflow that ensures scientific integrity and technical feasibility:

Role	Responsibility
Subject Matter Experts (SMEs)	Provide domain authority, surfacing nuanced constraints and validating scientific “truth”
Developers	Act as the implementation authority, ensuring tool realism and feasibility
Helper LLM Agents	Serve as “facilitation infrastructure”, accelerating the process by asking phase-aligned questions, drafting specifications in Markdown, and proposing revisions for human approval

Five Phases of Development

The methodology is organized into five distinct phases, each requiring joint approval at a stage gate before proceeding:

Scope & Decompose, Defines the target workflow, users, and constraints.
Key Information Elicitation, Captures details on tools, domain context, and required output formats.
Reasoning Policy & Guardrails, Codifies “expert-like” thinking logic and safety boundaries for uncertainty or tool errors.
Implementation, Translates approved artifacts into an engineered agent prompt using established design patterns.
Benchmarking & Verification, Establishes realistic query sets and scoring rubrics to detect regressions over time.

Key Design Targets

CARE deconstructs agent quality into four interacting targets to prevent “silent failures” where outputs look plausible but violate domain constraints or provenance expectations:

Target	Description
Interaction Policy	How the agent decomposes tasks and manages uncertainty.
Domain Grounding	Defines authoritative knowledge boundaries to reduce “plausible-but-wrong” outputs.
Tool Orchestration	Defines which tools to use and how to handle errors or retries.
Evaluation and Verification	Defines user-centric success criteria on realistic, complex tasks.

CARE GitHub

NASA-IMPACT/AKD-CARE (private)

CARE Papers

NASA Technical Reports, citation 20260000926