Improved Biomni support

2026-01-26 16:58:56 +08:00 · 2025-10-22 08:38:06 -07:00
parent 71a3c3750f
commit 77822efeed
9 changed files with 2512 additions and 3393 deletions
--- a/scientific-packages/biomni/SKILL.md
+++ b/scientific-packages/biomni/SKILL.md
@@ -1,375 +1,316 @@
 ---
 name: biomni
-description: "AI agent for autonomous biomedical task execution. CRISPR design, scRNA-seq, ADMET, GWAS, structure prediction, multi-omics, with automated planning/code generation, for complex workflows."
+description: Autonomous biomedical AI agent framework for executing complex research tasks across genomics, drug discovery, molecular biology, and clinical analysis. Use this skill when conducting multi-step biomedical research including CRISPR screening design, single-cell RNA-seq analysis, ADMET prediction, GWAS interpretation, rare disease diagnosis, or lab protocol optimization. Leverages LLM reasoning with code execution and integrated biomedical databases.
 ---

 # Biomni

 ## Overview

-Biomni is a general-purpose biomedical AI agent that autonomously executes research tasks across diverse biomedical subfields. Use Biomni to combine large language model reasoning with retrieval-augmented planning and code-based execution for scientific productivity and hypothesis generation. The system operates with an ~11GB biomedical knowledge base covering molecular, genomic, and clinical domains.
+Biomni is an open-source biomedical AI agent framework from Stanford's SNAP lab that autonomously executes complex research tasks across biomedical domains. Use this skill when working on multi-step biological reasoning tasks, analyzing biomedical data, or conducting research spanning genomics, drug discovery, molecular biology, and clinical analysis.
+
+## Core Capabilities
+
+Biomni excels at:
+
+1. **Multi-step biological reasoning** - Autonomous task decomposition and planning for complex biomedical queries
+2. **Code generation and execution** - Dynamic analysis pipeline creation for data processing
+3. **Knowledge retrieval** - Access to ~11GB of integrated biomedical databases and literature
+4. **Cross-domain problem solving** - Unified interface for genomics, proteomics, drug discovery, and clinical tasks
+
+## When to Use This Skill
+
+Use biomni for:
+- **CRISPR screening** - Design screens, prioritize genes, analyze knockout effects
+- **Single-cell RNA-seq** - Cell type annotation, differential expression, trajectory analysis
+- **Drug discovery** - ADMET prediction, target identification, compound optimization
+- **GWAS analysis** - Variant interpretation, causal gene identification, pathway enrichment
+- **Clinical genomics** - Rare disease diagnosis, variant pathogenicity, phenotype-genotype mapping
+- **Lab protocols** - Protocol optimization, literature synthesis, experimental design

 ## Quick Start

-Initialize and use the Biomni agent with these basic steps:
+### Installation and Setup
+
+Biomni requires conda environment setup and API keys for LLM providers:
+
+```bash
+# Clone repository and set up environment
+git clone https://github.com/snap-stanford/biomni
+cd biomni
+bash setup.sh
+
+# Or install via pip
+conda activate biomni_e1
+pip install biomni --upgrade
+```
+
+Configure API keys (store in `.env` file or environment variables):
+```bash
+export ANTHROPIC_API_KEY="your-key-here"
+# Optional: OpenAI, Azure, Google, Groq, AWS Bedrock keys
+```
+
+Use `scripts/setup_environment.py` for interactive setup assistance.
+
+### Basic Usage Pattern

 ```python
 from biomni.agent import A1

-# Initialize agent with data path and LLM model
+# Initialize agent with data path and LLM choice
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')

-# Execute a biomedical research task
-agent.go("Your biomedical task description")
+# Execute biomedical task autonomously
+agent.go("Your biomedical research question or task")
+
+# Save conversation history and results
+agent.save_conversation_history("report.pdf")
 ```

-The agent will autonomously decompose the task, retrieve relevant biomedical knowledge, generate and execute code, and provide results.
+## Working with Biomni

-## Installation and Setup
+### 1. Agent Initialization

-### Environment Preparation
-
-1. **Set up the conda environment:**
-   - Follow instructions in `biomni_env/README.md` from the repository
-   - Activate the environment: `conda activate biomni_e1`
-
-2. **Install the package:**
-   ```bash
-   pip install biomni --upgrade
-   ```
-
-   Or install from source:
-   ```bash
-   git clone https://github.com/snap-stanford/biomni.git
-   cd biomni
-   pip install -e .
-   ```
-
-3. **Configure API keys:**
-
-   Set up credentials via environment variables or `.env` file:
-   ```bash
-   export ANTHROPIC_API_KEY="your-key-here"
-   export OPENAI_API_KEY="your-key-here"  # Optional
-   ```
-
-4. **Data initialization:**
-
-   On first use, the agent will automatically download the ~11GB biomedical knowledge base.
-
-### LLM Provider Configuration
-
-Biomni supports multiple LLM providers. Configure the default provider using:
+The A1 class is the primary interface for biomni:

 ```python
+from biomni.agent import A1
 from biomni.config import default_config

-# Set the default LLM model
-default_config.llm = "claude-sonnet-4-20250514"  # Anthropic
-# default_config.llm = "gpt-4"  # OpenAI
-# default_config.llm = "azure/gpt-4"  # Azure OpenAI
-# default_config.llm = "gemini/gemini-pro"  # Google Gemini
-
-# Set timeout (optional)
-default_config.timeout_seconds = 1200
-
-# Set data path (optional)
-default_config.data_path = "./custom/data/path"
-```
-
-Refer to `references/llm_providers.md` for detailed configuration options for each provider.
-
-## Core Biomedical Research Tasks
-
-### 1. CRISPR Screening and Design
-
-Execute CRISPR screening tasks including guide RNA design, off-target analysis, and screening experiment planning:
-
-```python
-agent.go("Design a CRISPR screening experiment to identify genes involved in cancer cell resistance to drug X")
-```
-
-The agent will:
- Retrieve relevant gene databases
- Design guide RNAs with specificity analysis
- Plan experimental controls and readout strategies
- Generate analysis code for screening results
-
-### 2. Single-Cell RNA-seq Analysis
-
-Perform comprehensive scRNA-seq analysis workflows:
-
-```python
-agent.go("Analyze this 10X Genomics scRNA-seq dataset, identify cell types, and find differentially expressed genes between clusters")
-```
-
-Capabilities include:
- Quality control and preprocessing
- Dimensionality reduction and clustering
- Cell type annotation using marker databases
- Differential expression analysis
- Pathway enrichment analysis
-
-### 3. Molecular Property Prediction (ADMET)
-
-Predict absorption, distribution, metabolism, excretion, and toxicity properties:
-
-```python
-agent.go("Predict ADMET properties for these drug candidates: [SMILES strings]")
-```
-
-The agent handles:
- Molecular descriptor calculation
- Property prediction using integrated models
- Toxicity screening
- Drug-likeness assessment
-
-### 4. Genomic Analysis
-
-Execute genomic data analysis tasks:
-
-```python
-agent.go("Perform GWAS analysis to identify SNPs associated with disease phenotype in this cohort")
-```
-
-Supports:
- Genome-wide association studies (GWAS)
- Variant calling and annotation
- Population genetics analysis
- Functional genomics integration
-
-### 5. Protein Structure and Function
-
-Analyze protein sequences and structures:
-
-```python
-agent.go("Predict the structure of this protein sequence and identify potential binding sites")
-```
-
-Capabilities:
- Sequence analysis and domain identification
- Structure prediction integration
- Binding site prediction
- Protein-protein interaction analysis
-
-### 6. Disease Diagnosis and Classification
-
-Perform disease classification from multi-omics data:
-
-```python
-agent.go("Build a classifier to diagnose disease X from patient RNA-seq and clinical data")
-```
-
-### 7. Systems Biology and Pathway Analysis
-
-Analyze biological pathways and networks:
-
-```python
-agent.go("Identify dysregulated pathways in this differential expression dataset")
-```
-
-### 8. Drug Discovery and Repurposing
-
-Support drug discovery workflows:
-
-```python
-agent.go("Identify FDA-approved drugs that could be repurposed for treating disease Y based on mechanism of action")
-```
-
-## Advanced Features
-
-### Custom Configuration per Agent
-
-Override global configuration for specific agent instances:
-
-```python
+# Basic initialization
 agent = A1(
-    path='./project_data',
-    llm='gpt-4o',
-    timeout=1800
+    path='./data',  # Path to data lake (~11GB downloaded on first use)
+    llm='claude-sonnet-4-20250514'  # LLM model selection
 )
+
+# Advanced configuration
+default_config.llm = "gpt-4"
+default_config.timeout_seconds = 1200
+default_config.max_iterations = 50
 ```

-### Conversation History and Reporting
+**Supported LLM Providers:**
+- Anthropic Claude (recommended): `claude-sonnet-4-20250514`, `claude-opus-4-20250514`
+- OpenAI: `gpt-4`, `gpt-4-turbo`
+- Azure OpenAI: via Azure configuration
+- Google Gemini: `gemini-2.0-flash-exp`
+- Groq: `llama-3.3-70b-versatile`
+- AWS Bedrock: Various models via Bedrock API

-Save execution traces as formatted PDF reports:
+See `references/llm_providers.md` for detailed LLM configuration instructions.
+
+### 2. Task Execution Workflow
+
+Biomni follows an autonomous agent workflow:

 ```python
-# After executing tasks
-agent.save_conversation_history(
-    output_path='./reports/experiment_log.pdf',
-    format='pdf'
+# Step 1: Initialize agent
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+# Step 2: Execute task with natural language query
+result = agent.go("""
+Design a CRISPR screen to identify genes regulating autophagy in
+HEK293 cells. Prioritize genes based on essentiality and pathway
+relevance.
+""")
+
+# Step 3: Review generated code and analysis
+# Agent autonomously:
+# - Decomposes task into sub-steps
+# - Retrieves relevant biological knowledge
+# - Generates and executes analysis code
+# - Interprets results and provides insights
+
+# Step 4: Save results
+agent.save_conversation_history("autophagy_screen_report.pdf")
+```
+
+### 3. Common Task Patterns
+
+#### CRISPR Screening Design
+```python
+agent.go("""
+Design a genome-wide CRISPR knockout screen for identifying genes
+affecting [phenotype] in [cell type]. Include:
+1. sgRNA library design
+2. Gene prioritization criteria
+3. Expected hit genes based on pathway analysis
+""")
+```
+
+#### Single-Cell RNA-seq Analysis
+```python
+agent.go("""
+Analyze this single-cell RNA-seq dataset:
+- Perform quality control and filtering
+- Identify cell populations via clustering
+- Annotate cell types using marker genes
+- Conduct differential expression between conditions
+File path: [path/to/data.h5ad]
+""")
+```
+
+#### Drug ADMET Prediction
+```python
+agent.go("""
+Predict ADMET properties for these drug candidates:
+[SMILES strings or compound IDs]
+Focus on:
+- Absorption (Caco-2 permeability, HIA)
+- Distribution (plasma protein binding, BBB penetration)
+- Metabolism (CYP450 interaction)
+- Excretion (clearance)
+- Toxicity (hERG liability, hepatotoxicity)
+""")
+```
+
+#### GWAS Variant Interpretation
+```python
+agent.go("""
+Interpret GWAS results for [trait/disease]:
+- Identify genome-wide significant variants
+- Map variants to causal genes
+- Perform pathway enrichment analysis
+- Predict functional consequences
+Summary statistics file: [path/to/gwas_summary.txt]
+""")
+```
+
+See `references/use_cases.md` for comprehensive task examples across all biomedical domains.
+
+### 4. Data Integration
+
+Biomni integrates ~11GB of biomedical knowledge sources:
+- **Gene databases** - Ensembl, NCBI Gene, UniProt
+- **Protein structures** - PDB, AlphaFold
+- **Clinical datasets** - ClinVar, OMIM, HPO
+- **Literature indices** - PubMed abstracts, biomedical ontologies
+- **Pathway databases** - KEGG, Reactome, GO
+
+Data is automatically downloaded to the specified `path` on first use.
+
+### 5. MCP Server Integration
+
+Extend biomni with external tools via Model Context Protocol:
+
+```python
+# MCP servers can provide:
+# - FDA drug databases
+# - Web search for literature
+# - Custom biomedical APIs
+# - Laboratory equipment interfaces
+
+# Configure MCP servers in .biomni/mcp_config.json
+```
+
+### 6. Evaluation Framework
+
+Benchmark agent performance on biomedical tasks:
+
+```python
+from biomni.eval import BiomniEval1
+
+evaluator = BiomniEval1()
+
+# Evaluate on specific task types
+score = evaluator.evaluate(
+    task_type='crispr_design',
+    instance_id='test_001',
+    answer=agent_output
 )
+
+# Access evaluation dataset
+dataset = evaluator.load_dataset()
 ```

-Requires one of: WeasyPrint, markdown2pdf, or Pandoc.
-
-### Model Context Protocol (MCP) Integration
-
-Extend agent capabilities with external tools:
-
-```python
-# Add MCP-compatible tools
-agent.add_mcp(config_path='./mcp_config.json')
-```
-
-MCP enables integration with:
- Laboratory information management systems (LIMS)
- Specialized bioinformatics databases
- Custom analysis pipelines
- External computational resources
-
-### Using Biomni-R0 (Specialized Reasoning Model)
-
-Deploy the 32B parameter Biomni-R0 model for enhanced biological reasoning:
-
-```bash
-# Install SGLang
-pip install "sglang[all]"
-
-# Deploy Biomni-R0
-python -m sglang.launch_server \
-    --model-path snap-stanford/biomni-r0 \
-    --port 30000 \
-    --trust-remote-code
-```
-
-Then configure the agent:
-
-```python
-from biomni.config import default_config
-
-default_config.llm = "openai/biomni-r0"
-default_config.api_base = "http://localhost:30000/v1"
-```
-
-Biomni-R0 provides specialized reasoning for:
- Complex multi-step biological workflows
- Hypothesis generation and evaluation
- Experimental design optimization
- Literature-informed analysis
-
 ## Best Practices

-### Task Specification
-
-Provide clear, specific task descriptions:
-
-✅ **Good:** "Analyze this scRNA-seq dataset (file: data.h5ad) to identify T cell subtypes, then perform differential expression analysis comparing activated vs. resting T cells"
-
-❌ **Vague:** "Analyze my RNA-seq data"
-
-### Data Organization
-
-Structure data directories for efficient retrieval:
-
-```
-project/
-├── data/              # Biomni knowledge base
-├── raw_data/          # Your experimental data
-├── results/           # Analysis outputs
-└── reports/           # Generated reports
-```
-
-### Iterative Refinement
-
-Use iterative task execution for complex analyses:
-
-```python
-# Step 1: Exploratory analysis
-agent.go("Load and perform initial QC on the proteomics dataset")
-
-# Step 2: Based on results, refine analysis
-agent.go("Based on the QC results, remove low-quality samples and normalize using method X")
-
-# Step 3: Downstream analysis
-agent.go("Perform differential abundance analysis with adjusted parameters")
-```
+### Task Formulation
+- **Be specific** - Include biological context, organism, cell type, conditions
+- **Specify outputs** - Clearly state desired analysis outputs and formats
+- **Provide data paths** - Include file paths for datasets to analyze
+- **Set constraints** - Mention time/computational limits if relevant

 ### Security Considerations
+⚠️ **Important**: Biomni executes LLM-generated code with full system privileges. For production use:
+- Run in isolated environments (Docker, VMs)
+- Avoid exposing sensitive credentials
+- Review generated code before execution in sensitive contexts
+- Use sandboxed execution environments when possible

-**CRITICAL:** Biomni executes LLM-generated code with full system privileges. For production use:
+### Performance Optimization
+- **Choose appropriate LLMs** - Claude Sonnet 4 recommended for balance of speed/quality
+- **Set reasonable timeouts** - Adjust `default_config.timeout_seconds` for complex tasks
+- **Monitor iterations** - Track `max_iterations` to prevent runaway loops
+- **Cache data** - Reuse downloaded data lake across sessions

-1. **Use sandboxed environments:** Deploy in Docker containers or VMs with restricted permissions
-2. **Validate sensitive operations:** Review code before execution for file access, network calls, or credential usage
-3. **Limit data access:** Restrict agent access to only necessary data directories
-4. **Monitor execution:** Log all executed code for audit trails
+### Result Documentation
+```python
+# Always save conversation history for reproducibility
+agent.save_conversation_history("results/project_name_YYYYMMDD.pdf")

-Never run Biomni with:
- Unrestricted file system access
- Direct access to sensitive credentials
- Network access to production systems
- Elevated system privileges
+# Include in reports:
+# - Original task description
+# - Generated analysis code
+# - Results and interpretations
+# - Data sources used
+```

-### Model Selection Guidelines
+## Resources

-Choose models based on task complexity:
+### References
+Detailed documentation available in the `references/` directory:

- **Claude Sonnet 4:** Recommended for most biomedical tasks, excellent biological reasoning
- **GPT-4/GPT-4o:** Strong general capabilities, good for diverse tasks
- **Biomni-R0:** Specialized for complex biological reasoning, multi-step workflows
- **Smaller models:** Use for simple, well-defined tasks to reduce cost
+- **`api_reference.md`** - Complete API documentation for A1 class, configuration, and evaluation
+- **`llm_providers.md`** - LLM provider setup (Anthropic, OpenAI, Azure, Google, Groq, AWS)
+- **`use_cases.md`** - Comprehensive task examples for all biomedical domains

-## Evaluation and Benchmarking
+### Scripts
+Helper scripts in the `scripts/` directory:

-Biomni-Eval1 benchmark contains 433 evaluation instances across 10 biological tasks:
+- **`setup_environment.py`** - Interactive environment and API key configuration
+- **`generate_report.py`** - Enhanced PDF report generation with custom formatting

- GWAS analysis
- Disease diagnosis
- Gene detection and classification
- Molecular property prediction
- Pathway analysis
- Protein function prediction
- Drug response prediction
- Variant interpretation
- Cell type annotation
- Biomarker discovery
-
-Use the benchmark to:
- Evaluate custom agent configurations
- Compare LLM providers for specific tasks
- Validate analysis pipelines
+### External Resources
+- **GitHub**: https://github.com/snap-stanford/biomni
+- **Web Platform**: https://biomni.stanford.edu
+- **Paper**: https://www.biorxiv.org/content/10.1101/2025.05.30.656746v1
+- **Model**: https://huggingface.co/biomni/Biomni-R0-32B-Preview
+- **Evaluation Dataset**: https://huggingface.co/datasets/biomni/Eval1

 ## Troubleshooting

 ### Common Issues

-**Issue:** Data download fails or times out
-**Solution:** Manually download the knowledge base or increase timeout settings
+**Data download fails**
+```python
+# Manually trigger data lake download
+agent = A1(path='./data', llm='your-llm')
+# First .go() call will download data
+```

-**Issue:** Package dependency conflicts
-**Solution:** Some optional dependencies cannot be installed by default due to conflicts. Install specific packages manually and uncomment relevant code sections as documented in the repository
+**API key errors**
+```bash
+# Verify environment variables
+echo $ANTHROPIC_API_KEY
+# Or check .env file in working directory
+```

-**Issue:** LLM API errors
-**Solution:** Verify API key configuration, check rate limits, ensure sufficient credits
+**Timeout on complex tasks**
+```python
+from biomni.config import default_config
+default_config.timeout_seconds = 3600  # 1 hour
+```

-**Issue:** Memory errors with large datasets
-**Solution:** Process data in chunks, use data subsampling, or deploy on higher-memory instances
+**Memory issues with large datasets**
+- Use streaming for large files
+- Process data in chunks
+- Increase system memory allocation

 ### Getting Help

-For detailed troubleshooting:
- Review the Biomni GitHub repository issues
- Check `references/api_reference.md` for detailed API documentation
- Consult `references/task_examples.md` for comprehensive task patterns
-
-## Resources
-
-### references/
-Detailed reference documentation for advanced usage:
-
- **api_reference.md:** Complete API documentation for A1 agent, configuration objects, and utility functions
- **llm_providers.md:** Comprehensive guide for configuring all supported LLM providers (Anthropic, OpenAI, Azure, Gemini, Groq, Ollama, AWS Bedrock)
- **task_examples.md:** Extensive collection of biomedical task examples with code patterns
-
-### scripts/
-Helper scripts for common operations:
-
- **setup_environment.py:** Automated environment setup and validation
- **generate_report.py:** Enhanced PDF report generation with custom formatting
-
-Load reference documentation as needed:
-```python
-# Claude can read reference files when needed for detailed information
-# Example: "Check references/llm_providers.md for Azure OpenAI configuration"
-```
+For issues or questions:
+- GitHub Issues: https://github.com/snap-stanford/biomni/issues
+- Documentation: Check `references/` files for detailed guidance
+- Community: Stanford SNAP lab and biomni contributors
--- a/scientific-packages/biomni/references/api_reference.md
+++ b/scientific-packages/biomni/references/api_reference.md
@@ -1,635 +1,460 @@
 # Biomni API Reference

-This document provides comprehensive API documentation for the Biomni biomedical AI agent system.
+Comprehensive API documentation for the biomni framework.

-## Core Classes
+## A1 Agent Class

-### A1 Agent
+The A1 class is the primary interface for interacting with biomni.

-The primary agent class for executing biomedical research tasks.
-
-#### Initialization
+### Initialization

 ```python
 from biomni.agent import A1

 agent = A1(
-    path='./data',              # Path to biomedical knowledge base
-    llm='claude-sonnet-4-20250514',  # LLM model identifier
-    timeout=None,               # Optional timeout in seconds
-    verbose=True               # Enable detailed logging
+    path: str,              # Path to data lake directory
+    llm: str,               # LLM model identifier
+    verbose: bool = True,   # Enable verbose logging
+    mcp_config: str = None  # Path to MCP server configuration
 )
 ```

 **Parameters:**

- `path` (str, required): Directory path where the biomedical knowledge base is stored or will be downloaded. First-time initialization will download ~11GB of data.
- `llm` (str, optional): LLM model identifier. Defaults to the value in `default_config.llm`. Supports multiple providers (see LLM Providers section).
- `timeout` (int, optional): Maximum execution time in seconds for agent operations. Overrides `default_config.timeout_seconds`.
- `verbose` (bool, optional): Enable verbose logging for debugging. Default: True.
+- **`path`** (str, required) - Directory path for biomni data lake (~11GB). Data is automatically downloaded on first use if not present.

-**Returns:** A1 agent instance ready for task execution.
+- **`llm`** (str, required) - LLM model identifier. Options include:
+  - `'claude-sonnet-4-20250514'` - Recommended for balanced performance
+  - `'claude-opus-4-20250514'` - Maximum capability
+  - `'gpt-4'`, `'gpt-4-turbo'` - OpenAI models
+  - `'gemini-2.0-flash-exp'` - Google Gemini
+  - `'llama-3.3-70b-versatile'` - Via Groq
+  - Custom model endpoints via provider configuration

-#### Methods
+- **`verbose`** (bool, optional, default=True) - Enable detailed logging of agent reasoning, tool use, and code execution.

-##### `go(task_description: str) -> None`
+- **`mcp_config`** (str, optional) - Path to MCP (Model Context Protocol) server configuration file for external tool integration.
+
+**Example:**
+```python
+# Basic initialization
+agent = A1(path='./biomni_data', llm='claude-sonnet-4-20250514')
+
+# With MCP integration
+agent = A1(
+    path='./biomni_data',
+    llm='claude-sonnet-4-20250514',
+    mcp_config='./.biomni/mcp_config.json'
+)
+```
+
+### Core Methods
+
+#### `go(query: str) -> str`

 Execute a biomedical research task autonomously.

 ```python
-agent.go("Analyze this scRNA-seq dataset and identify cell types")
+result = agent.go(query: str)
 ```

 **Parameters:**
- `task_description` (str, required): Natural language description of the biomedical task to execute. Be specific about:
-  - Data location and format
-  - Desired analysis or output
-  - Any specific methods or parameters
-  - Expected results format
+- **`query`** (str) - Natural language description of the biomedical task to execute
+
+**Returns:**
+- **`str`** - Final answer or analysis result from the agent

 **Behavior:**
-1. Decomposes the task into executable steps
-2. Retrieves relevant biomedical knowledge from the data lake
-3. Generates and executes Python/R code
-4. Provides results and visualizations
-5. Handles errors and retries with refinement
+1. Decomposes query into executable sub-tasks
+2. Retrieves relevant knowledge from integrated databases
+3. Generates and executes Python code for analysis
+4. Iterates on results until task completion
+5. Returns final synthesized answer

-**Notes:**
- Executes code with system privileges - use in sandboxed environments
- Long-running tasks may require timeout adjustments
- Intermediate results are displayed during execution
+**Example:**
+```python
+result = agent.go("""
+Identify genes associated with Alzheimer's disease from GWAS data.
+Perform pathway enrichment analysis on top hits.
+""")
+print(result)
+```

-##### `save_conversation_history(output_path: str, format: str = 'pdf') -> None`
+#### `save_conversation_history(output_path: str, format: str = 'pdf')`

-Export conversation history and execution trace as a formatted report.
+Save complete conversation history including task, reasoning, code, and results.

 ```python
 agent.save_conversation_history(
-    output_path='./reports/analysis_log.pdf',
-    format='pdf'
+    output_path: str,
+    format: str = 'pdf'
 )
 ```

 **Parameters:**
- `output_path` (str, required): File path for the output report
- `format` (str, optional): Output format. Options: 'pdf', 'markdown'. Default: 'pdf'
+- **`output_path`** (str) - File path for saved report
+- **`format`** (str, optional, default='pdf') - Output format: `'pdf'`, `'html'`, or `'markdown'`

-**Requirements:**
- For PDF: Install one of: WeasyPrint, markdown2pdf, or Pandoc
-  ```bash
-  pip install weasyprint  # Recommended
-  # or
-  pip install markdown2pdf
-  # or install Pandoc system-wide
-  ```
+**Example:**
+```python
+agent.save_conversation_history('reports/alzheimers_gwas_analysis.pdf')
+```

-**Report Contents:**
- Task description and parameters
- Retrieved biomedical knowledge
- Generated code with execution traces
- Results, visualizations, and outputs
- Timestamps and execution metadata
+#### `reset()`

-##### `add_mcp(config_path: str) -> None`
-
-Add Model Context Protocol (MCP) tools to extend agent capabilities.
+Reset agent state and clear conversation history.

 ```python
-agent.add_mcp(config_path='./mcp_tools_config.json')
+agent.reset()
 ```

-**Parameters:**
- `config_path` (str, required): Path to MCP configuration JSON file
+Use when starting a new independent task to clear previous context.

-**MCP Configuration Format:**
-```json
-{
-  "tools": [
-    {
-      "name": "tool_name",
-      "endpoint": "http://localhost:8000/tool",
-      "description": "Tool description for LLM",
-      "parameters": {
-        "param1": "string",
-        "param2": "integer"
-      }
-    }
-  ]
-}
+**Example:**
+```python
+# Task 1
+agent.go("Analyze dataset A")
+agent.save_conversation_history("task1.pdf")
+
+# Reset for fresh context
+agent.reset()
+
+# Task 2 - independent of Task 1
+agent.go("Analyze dataset B")
 ```

-**Use Cases:**
- Connect to laboratory information systems
- Integrate proprietary databases
- Access specialized computational resources
- Link to institutional data repositories
+### Configuration via default_config

-## Configuration
-
-### default_config
-
-Global configuration object for Biomni settings.
+Global configuration parameters accessible via `biomni.config.default_config`.

 ```python
 from biomni.config import default_config
-```

-#### Attributes
-
-##### `llm: str`
-
-Default LLM model identifier for all agent instances.
-
-```python
+# LLM Configuration
 default_config.llm = "claude-sonnet-4-20250514"
-```
+default_config.llm_temperature = 0.7

-**Supported Models:**
-
-**Anthropic:**
- `claude-sonnet-4-20250514` (Recommended)
- `claude-opus-4-20250514`
- `claude-3-5-sonnet-20241022`
- `claude-3-opus-20240229`
-
-**OpenAI:**
- `gpt-4o`
- `gpt-4`
- `gpt-4-turbo`
- `gpt-3.5-turbo`
-
-**Azure OpenAI:**
- `azure/gpt-4`
- `azure/<deployment-name>`
-
-**Google Gemini:**
- `gemini/gemini-pro`
- `gemini/gemini-1.5-pro`
-
-**Groq:**
- `groq/llama-3.1-70b-versatile`
- `groq/mixtral-8x7b-32768`
-
-**Ollama (Local):**
- `ollama/llama3`
- `ollama/mistral`
- `ollama/<model-name>`
-
-**AWS Bedrock:**
- `bedrock/anthropic.claude-v2`
- `bedrock/anthropic.claude-3-sonnet`
-
-**Custom/Biomni-R0:**
- `openai/biomni-r0` (requires local SGLang deployment)
-
-##### `timeout_seconds: int`
-
-Default timeout for agent operations in seconds.
-
-```python
+# Execution Parameters
 default_config.timeout_seconds = 1200  # 20 minutes
+default_config.max_iterations = 50     # Max reasoning loops
+default_config.max_tokens = 4096       # Max tokens per LLM call
+
+# Code Execution
+default_config.enable_code_execution = True
+default_config.sandbox_mode = False    # Enable for restricted execution
+
+# Data and Caching
+default_config.data_cache_dir = "./biomni_cache"
+default_config.enable_caching = True
 ```

-**Recommended Values:**
- Simple tasks (QC, basic analysis): 300-600 seconds
- Medium tasks (differential expression, clustering): 600-1200 seconds
- Complex tasks (full pipelines, ML models): 1200-3600 seconds
- Very complex tasks: 3600+ seconds
+**Key Parameters:**

-##### `data_path: str`
+- **`timeout_seconds`** (int, default=1200) - Maximum time for task execution. Increase for complex analyses.

-Default path to biomedical knowledge base.
+- **`max_iterations`** (int, default=50) - Maximum agent reasoning loops. Prevents infinite loops.
+
+- **`enable_code_execution`** (bool, default=True) - Allow agent to execute generated code. Disable for code generation only.
+
+- **`sandbox_mode`** (bool, default=False) - Enable sandboxed code execution (requires additional setup).
+
+## BiomniEval1 Evaluation Framework
+
+Framework for benchmarking agent performance on biomedical tasks.
+
+### Initialization

 ```python
-default_config.data_path = "/path/to/biomni/data"
+from biomni.eval import BiomniEval1
+
+evaluator = BiomniEval1(
+    dataset_path: str = None,  # Path to evaluation dataset
+    metrics: list = None        # Evaluation metrics to compute
+)
 ```

-**Storage Requirements:**
- Initial download: ~11GB
- Extracted size: ~15GB
- Additional working space: ~5-10GB recommended
-
-##### `api_base: str`
-
-Custom API endpoint for LLM providers (advanced usage).
-
+**Example:**
 ```python
-# For local Biomni-R0 deployment
-default_config.api_base = "http://localhost:30000/v1"
-
-# For custom OpenAI-compatible endpoints
-default_config.api_base = "https://your-endpoint.com/v1"
+evaluator = BiomniEval1()
 ```

-##### `max_retries: int`
+### Methods

-Number of retry attempts for failed operations.
+#### `evaluate(task_type: str, instance_id: str, answer: str) -> float`
+
+Evaluate agent answer against ground truth.

 ```python
-default_config.max_retries = 3
-```
-
-#### Methods
-
-##### `reset() -> None`
-
-Reset all configuration values to system defaults.
-
-```python
-default_config.reset()
-```
-
-## Database Query System
-
-Biomni includes a retrieval-augmented generation (RAG) system for querying the biomedical knowledge base.
-
-### Query Functions
-
-#### `query_genes(query: str, top_k: int = 10) -> List[Dict]`
-
-Query gene information from integrated databases.
-
-```python
-from biomni.database import query_genes
-
-results = query_genes(
-    query="genes involved in p53 pathway",
-    top_k=20
+score = evaluator.evaluate(
+    task_type: str,     # Task category
+    instance_id: str,   # Specific task instance
+    answer: str         # Agent-generated answer
 )
 ```

 **Parameters:**
- `query` (str): Natural language or gene identifier query
- `top_k` (int): Number of results to return
+- **`task_type`** (str) - Task category: `'crispr_design'`, `'scrna_analysis'`, `'gwas_interpretation'`, `'drug_admet'`, `'clinical_diagnosis'`
+- **`instance_id`** (str) - Unique identifier for task instance from dataset
+- **`answer`** (str) - Agent's answer to evaluate

-**Returns:** List of dictionaries containing:
- `gene_symbol`: Official gene symbol
- `gene_name`: Full gene name
- `description`: Functional description
- `pathways`: Associated biological pathways
- `go_terms`: Gene Ontology annotations
- `diseases`: Associated diseases
- `similarity_score`: Relevance score (0-1)
+**Returns:**
+- **`float`** - Evaluation score (0.0 to 1.0)

-#### `query_proteins(query: str, top_k: int = 10) -> List[Dict]`
+**Example:**
+```python
+# Generate answer
+result = agent.go("Design CRISPR screen for autophagy genes")

-Query protein information from UniProt and other sources.
+# Evaluate
+score = evaluator.evaluate(
+    task_type='crispr_design',
+    instance_id='autophagy_001',
+    answer=result
+)
+print(f"Score: {score:.2f}")
+```
+
+#### `load_dataset() -> dict`
+
+Load the Biomni-Eval1 benchmark dataset.

 ```python
-from biomni.database import query_proteins
+dataset = evaluator.load_dataset()
+```

-results = query_proteins(
-    query="kinase proteins in cell cycle",
-    top_k=15
+**Returns:**
+- **`dict`** - Dictionary with task instances organized by task type
+
+**Example:**
+```python
+dataset = evaluator.load_dataset()
+
+for task_type, instances in dataset.items():
+    print(f"{task_type}: {len(instances)} instances")
+```
+
+#### `run_benchmark(agent: A1, task_types: list = None) -> dict`
+
+Run full benchmark evaluation on agent.
+
+```python
+results = evaluator.run_benchmark(
+    agent: A1,
+    task_types: list = None  # Specific task types or None for all
 )
 ```

-**Returns:** List of dictionaries with protein metadata:
- `uniprot_id`: UniProt accession
- `protein_name`: Protein name
- `function`: Functional annotation
- `domains`: Protein domains
- `subcellular_location`: Cellular localization
- `similarity_score`: Relevance score
-
-#### `query_drugs(query: str, top_k: int = 10) -> List[Dict]`
-
-Query drug and compound information.
+**Returns:**
+- **`dict`** - Results with scores, timing, and detailed metrics per task

+**Example:**
 ```python
-from biomni.database import query_drugs
-
-results = query_drugs(
-    query="FDA approved cancer drugs targeting EGFR",
-    top_k=10
+results = evaluator.run_benchmark(
+    agent=agent,
+    task_types=['crispr_design', 'scrna_analysis']
 )
+
+print(f"Overall accuracy: {results['mean_score']:.2f}")
+print(f"Average time: {results['mean_time']:.1f}s")
 ```

-**Returns:** Drug information including:
- `drug_name`: Common name
- `drugbank_id`: DrugBank identifier
- `indication`: Therapeutic indication
- `mechanism`: Mechanism of action
- `targets`: Molecular targets
- `approval_status`: Regulatory status
- `smiles`: Chemical structure (SMILES notation)
+## Data Lake API

-#### `query_diseases(query: str, top_k: int = 10) -> List[Dict]`
+Access integrated biomedical databases programmatically.

-Query disease information from clinical databases.
+### Gene Database Queries

 ```python
-from biomni.database import query_diseases
+from biomni.data import GeneDB

-results = query_diseases(
-    query="autoimmune diseases affecting joints",
-    top_k=10
-)
+gene_db = GeneDB(path='./biomni_data')
+
+# Query gene information
+gene_info = gene_db.get_gene('BRCA1')
+# Returns: {'symbol': 'BRCA1', 'name': '...', 'function': '...', ...}
+
+# Search genes by pathway
+pathway_genes = gene_db.search_by_pathway('DNA repair')
+# Returns: List of gene symbols in pathway
+
+# Get gene interactions
+interactions = gene_db.get_interactions('TP53')
+# Returns: List of interacting genes with interaction types
 ```

-**Returns:** Disease data:
- `disease_name`: Standard disease name
- `disease_id`: Ontology identifier
- `symptoms`: Clinical manifestations
- `associated_genes`: Genetic associations
- `prevalence`: Epidemiological data
-
-#### `query_pathways(query: str, top_k: int = 10) -> List[Dict]`
-
-Query biological pathways from KEGG, Reactome, and other sources.
+### Protein Structure Access

 ```python
-from biomni.database import query_pathways
+from biomni.data import ProteinDB

-results = query_pathways(
-    query="immune response signaling pathways",
-    top_k=15
-)
+protein_db = ProteinDB(path='./biomni_data')
+
+# Get AlphaFold structure
+structure = protein_db.get_structure('P38398')  # BRCA1 UniProt ID
+# Returns: Path to PDB file or structure object
+
+# Search PDB database
+pdb_entries = protein_db.search_pdb('kinase', resolution_max=2.5)
+# Returns: List of PDB IDs matching criteria
 ```

-**Returns:** Pathway information:
- `pathway_name`: Pathway name
- `pathway_id`: Database identifier
- `genes`: Genes in pathway
- `description`: Functional description
- `source`: Database source (KEGG, Reactome, etc.)
-
-## Data Structures
-
-### TaskResult
-
-Result object returned by complex agent operations.
+### Clinical Data Access

 ```python
-class TaskResult:
-    success: bool           # Whether task completed successfully
-    output: Any            # Task output (varies by task)
-    code: str             # Generated code
-    execution_time: float # Execution time in seconds
-    error: Optional[str]  # Error message if failed
-    metadata: Dict        # Additional metadata
+from biomni.data import ClinicalDB
+
+clinical_db = ClinicalDB(path='./biomni_data')
+
+# Query ClinVar variants
+variant_info = clinical_db.get_variant('rs429358')  # APOE4 variant
+# Returns: {'significance': '...', 'disease': '...', 'frequency': ...}
+
+# Search OMIM for disease
+disease_info = clinical_db.search_omim('Alzheimer')
+# Returns: List of OMIM entries with gene associations
 ```

-### BiomedicalEntity
-
-Base class for biomedical entities in the knowledge base.
+### Literature Search

 ```python
-class BiomedicalEntity:
-    entity_id: str        # Unique identifier
-    entity_type: str      # Type (gene, protein, drug, etc.)
-    name: str            # Entity name
-    description: str     # Description
-    attributes: Dict     # Additional attributes
-    references: List[str] # Literature references
+from biomni.data import LiteratureDB
+
+lit_db = LiteratureDB(path='./biomni_data')
+
+# Search PubMed abstracts
+papers = lit_db.search('CRISPR screening cancer', max_results=10)
+# Returns: List of paper dictionaries with titles, abstracts, PMIDs
+
+# Get citations for paper
+citations = lit_db.get_citations('PMID:12345678')
+# Returns: List of citing papers
 ```

-## Utility Functions
+## MCP Server Integration

-### `download_data(path: str, force: bool = False) -> None`
+Extend biomni with external tools via Model Context Protocol.

-Manually download or update the biomedical knowledge base.
+### Configuration Format
+
+Create `.biomni/mcp_config.json`:
+
+```json
+{
+  "servers": {
+    "fda-drugs": {
+      "command": "python",
+      "args": ["-m", "mcp_server_fda"],
+      "env": {
+        "FDA_API_KEY": "${FDA_API_KEY}"
+      }
+    },
+    "web-search": {
+      "command": "npx",
+      "args": ["-y", "@modelcontextprotocol/server-brave-search"],
+      "env": {
+        "BRAVE_API_KEY": "${BRAVE_API_KEY}"
+      }
+    }
+  }
+}
+```
+
+### Using MCP Tools in Tasks

 ```python
-from biomni.utils import download_data
-
-download_data(
+# Initialize with MCP config
+agent = A1(
    path='./data',
-    force=True  # Force re-download
+    llm='claude-sonnet-4-20250514',
+    mcp_config='./.biomni/mcp_config.json'
 )
-```

-### `validate_environment() -> Dict[str, bool]`
-
-Check if the environment is properly configured.
-
-```python
-from biomni.utils import validate_environment
-
-status = validate_environment()
-# Returns: {
-#   'conda_env': True,
-#   'api_keys': True,
-#   'data_available': True,
-#   'dependencies': True
-# }
-```
-
-### `list_available_models() -> List[str]`
-
-Get a list of available LLM models based on configured API keys.
-
-```python
-from biomni.utils import list_available_models
-
-models = list_available_models()
-# Returns: ['claude-sonnet-4-20250514', 'gpt-4o', ...]
+# Agent can now use MCP tools automatically
+result = agent.go("""
+Search for FDA-approved drugs targeting EGFR.
+Get their approval dates and indications.
+""")
+# Agent uses fda-drugs MCP server automatically
 ```

 ## Error Handling

-### Common Exceptions
-
-#### `BiomniConfigError`
-
-Raised when configuration is invalid or incomplete.
+Common exceptions and handling strategies:

 ```python
-from biomni.exceptions import BiomniConfigError
+from biomni.exceptions import (
+    BiomniException,
+    LLMError,
+    CodeExecutionError,
+    DataNotFoundError,
+    TimeoutError
+)

 try:
-    agent = A1(path='./data')
-except BiomniConfigError as e:
-    print(f"Configuration error: {e}")
-```
-
-#### `BiomniExecutionError`
-
-Raised when code generation or execution fails.
-
-```python
-from biomni.exceptions import BiomniExecutionError
-
-try:
-    agent.go("invalid task")
-except BiomniExecutionError as e:
-    print(f"Execution failed: {e}")
-    # Access failed code: e.code
-    # Access error details: e.details
-```
-
-#### `BiomniDataError`
-
-Raised when knowledge base or data access fails.
-
-```python
-from biomni.exceptions import BiomniDataError
-
-try:
-    results = query_genes("unknown query format")
-except BiomniDataError as e:
-    print(f"Data access error: {e}")
-```
-
-#### `BiomniTimeoutError`
-
-Raised when operations exceed timeout limit.
-
-```python
-from biomni.exceptions import BiomniTimeoutError
-
-try:
-    agent.go("very complex long-running task")
-except BiomniTimeoutError as e:
-    print(f"Task timed out after {e.duration} seconds")
-    # Partial results may be available: e.partial_results
+    result = agent.go("Complex biomedical task")
+except TimeoutError:
+    # Task exceeded timeout_seconds
+    print("Task timed out. Consider increasing timeout.")
+    default_config.timeout_seconds = 3600
+except CodeExecutionError as e:
+    # Generated code failed to execute
+    print(f"Code execution error: {e}")
+    # Review generated code in conversation history
+except DataNotFoundError:
+    # Required data not in data lake
+    print("Data not found. Ensure data lake is downloaded.")
+except LLMError as e:
+    # LLM API error
+    print(f"LLM error: {e}")
+    # Check API keys and rate limits
 ```

 ## Best Practices

-### Efficient Knowledge Retrieval
+### Efficient API Usage

-Pre-query databases for relevant context before complex tasks:
+1. **Reuse agent instances** for related tasks to maintain context
+2. **Set appropriate timeouts** based on task complexity
+3. **Use caching** to avoid redundant data downloads
+4. **Monitor iterations** to detect reasoning loops early
+
+### Production Deployment

 ```python
-from biomni.database import query_genes, query_pathways
+from biomni.agent import A1
+from biomni.config import default_config
+import logging

-# Gather relevant biological context first
-genes = query_genes("cell cycle genes", top_k=50)
-pathways = query_pathways("cell cycle regulation", top_k=20)
+# Configure logging
+logging.basicConfig(level=logging.INFO)

-# Then execute task with enriched context
-agent.go(f"""
-Analyze the cell cycle progression in this dataset.
-Focus on these genes: {[g['gene_symbol'] for g in genes]}
-Consider these pathways: {[p['pathway_name'] for p in pathways]}
-""")
-```
+# Production settings
+default_config.timeout_seconds = 3600
+default_config.max_iterations = 100
+default_config.sandbox_mode = True  # Enable sandboxing

-### Error Recovery
-
-Implement robust error handling for production workflows:
-
-```python
-from biomni.exceptions import BiomniExecutionError, BiomniTimeoutError
-
-max_attempts = 3
-for attempt in range(max_attempts):
-    try:
-        agent.go("complex biomedical task")
-        break
-    except BiomniTimeoutError:
-        # Increase timeout and retry
-        default_config.timeout_seconds *= 2
-        print(f"Timeout, retrying with {default_config.timeout_seconds}s timeout")
-    except BiomniExecutionError as e:
-        # Refine task based on error
-        print(f"Execution failed: {e}, refining task...")
-        # Optionally modify task description
-    else:
-        print("Task failed after max attempts")
+# Initialize with error handling
+try:
+    agent = A1(path='/data/biomni', llm='claude-sonnet-4-20250514')
+    result = agent.go(task_query)
+    agent.save_conversation_history(f'reports/{task_id}.pdf')
+except Exception as e:
+    logging.error(f"Task {task_id} failed: {e}")
+    # Handle failure appropriately
 ```

 ### Memory Management

-For large-scale analyses, manage memory explicitly:
+For large-scale analyses:

 ```python
-import gc
-
 # Process datasets in chunks
-for chunk_id in range(num_chunks):
-    agent.go(f"Process data chunk {chunk_id} located at data/chunk_{chunk_id}.h5ad")
+chunk_results = []
+for chunk in dataset_chunks:
+    agent.reset()  # Clear memory between chunks
+    result = agent.go(f"Analyze chunk: {chunk}")
+    chunk_results.append(result)

-    # Force garbage collection between chunks
-    gc.collect()
-
-    # Save intermediate results
-    agent.save_conversation_history(f"./reports/chunk_{chunk_id}.pdf")
-```
-
-### Reproducibility
-
-Ensure reproducible analyses by:
-
-1. **Fixing random seeds:**
-```python
-agent.go("Set random seed to 42 for all analyses, then perform clustering...")
-```
-
-2. **Logging configuration:**
-```python
-import json
-config_log = {
-    'llm': default_config.llm,
-    'timeout': default_config.timeout_seconds,
-    'data_path': default_config.data_path,
-    'timestamp': datetime.now().isoformat()
-}
-with open('config_log.json', 'w') as f:
-    json.dump(config_log, f, indent=2)
-```
-
-3. **Saving execution traces:**
-```python
-# Always save detailed reports
-agent.save_conversation_history('./reports/full_analysis.pdf')
-```
-
-## Performance Optimization
-
-### Model Selection Strategy
-
-Choose models based on task characteristics:
-
-```python
-# For exploratory, simple tasks
-default_config.llm = "gpt-3.5-turbo"  # Fast, cost-effective
-
-# For standard biomedical analyses
-default_config.llm = "claude-sonnet-4-20250514"  # Recommended
-
-# For complex reasoning and hypothesis generation
-default_config.llm = "claude-opus-4-20250514"  # Highest quality
-
-# For specialized biological reasoning
-default_config.llm = "openai/biomni-r0"  # Requires local deployment
-```
-
-### Timeout Tuning
-
-Set appropriate timeouts based on task complexity:
-
-```python
-# Quick queries and simple analyses
-agent = A1(path='./data', timeout=300)
-
-# Standard workflows
-agent = A1(path='./data', timeout=1200)
-
-# Full pipelines with ML training
-agent = A1(path='./data', timeout=3600)
-```
-
-### Caching and Reuse
-
-Reuse agent instances for multiple related tasks:
-
-```python
-# Create agent once
-agent = A1(path='./data', llm='claude-sonnet-4-20250514')
-
-# Execute multiple related tasks
-tasks = [
-    "Load and QC the scRNA-seq dataset",
-    "Perform clustering with resolution 0.5",
-    "Identify marker genes for each cluster",
-    "Annotate cell types based on markers"
-]
-
-for task in tasks:
-    agent.go(task)
-
-# Save complete workflow
-agent.save_conversation_history('./reports/full_workflow.pdf')
+# Combine results
+final_result = combine_results(chunk_results)
 ```
--- a/scientific-packages/biomni/references/llm_providers.md
+++ b/scientific-packages/biomni/references/llm_providers.md
--- a/scientific-packages/biomni/references/task_examples.md
+++ b/scientific-packages/biomni/references/task_examples.md
--- a/scientific-packages/biomni/references/use_cases.md
+++ b/scientific-packages/biomni/references/use_cases.md
@@ -0,0 +1,867 @@
+# Biomni Use Cases and Examples
+
+Comprehensive examples demonstrating biomni across biomedical research domains.
+
+## Table of Contents
+
+1. [CRISPR Screening and Gene Editing](#crispr-screening-and-gene-editing)
+2. [Single-Cell RNA-seq Analysis](#single-cell-rna-seq-analysis)
+3. [Drug Discovery and ADMET](#drug-discovery-and-admet)
+4. [GWAS and Genetic Analysis](#gwas-and-genetic-analysis)
+5. [Clinical Genomics and Diagnostics](#clinical-genomics-and-diagnostics)
+6. [Protein Structure and Function](#protein-structure-and-function)
+7. [Literature and Knowledge Synthesis](#literature-and-knowledge-synthesis)
+8. [Multi-Omics Integration](#multi-omics-integration)
+
+---
+
+## CRISPR Screening and Gene Editing
+
+### Example 1: Genome-Wide CRISPR Screen Design
+
+**Task:** Design a CRISPR knockout screen to identify genes regulating autophagy.
+
+```python
+from biomni.agent import A1
+
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+result = agent.go("""
+Design a genome-wide CRISPR knockout screen to identify genes regulating
+autophagy in HEK293 cells.
+
+Requirements:
+1. Generate comprehensive sgRNA library targeting all protein-coding genes
+2. Design 4 sgRNAs per gene with optimal on-target and minimal off-target scores
+3. Include positive controls (known autophagy regulators: ATG5, BECN1, ULK1)
+4. Include negative controls (non-targeting sgRNAs)
+5. Prioritize genes based on:
+   - Existing autophagy pathway annotations
+   - Protein-protein interactions with known autophagy factors
+   - Expression levels in HEK293 cells
+6. Output sgRNA sequences, scores, and gene prioritization rankings
+
+Provide analysis as Python code and interpret results.
+""")
+
+agent.save_conversation_history("autophagy_screen_design.pdf")
+```
+
+**Expected Output:**
+- sgRNA library with ~80,000 guides (4 per gene × ~20,000 genes)
+- On-target and off-target scores for each sgRNA
+- Prioritized gene list based on pathway enrichment
+- Quality control metrics for library design
+
+### Example 2: CRISPR Off-Target Prediction
+
+```python
+result = agent.go("""
+Analyze potential off-target effects for this sgRNA sequence:
+GCTGAAGATCCAGTTCGATG
+
+Tasks:
+1. Identify all genomic locations with ≤3 mismatches
+2. Score each potential off-target site
+3. Assess likelihood of cleavage at off-target sites
+4. Recommend whether sgRNA is suitable for use
+5. If unsuitable, suggest alternative sgRNAs for the same gene
+""")
+```
+
+### Example 3: Screen Hit Analysis
+
+```python
+result = agent.go("""
+Analyze CRISPR screen results from autophagy phenotype screen.
+
+Input file: screen_results.csv
+Columns: sgRNA_ID, gene, log2_fold_change, p_value, FDR
+
+Tasks:
+1. Identify significant hits (FDR < 0.05, |LFC| > 1.5)
+2. Perform gene ontology enrichment on hit genes
+3. Map hits to known autophagy pathways
+4. Identify novel candidates not previously linked to autophagy
+5. Predict functional relationships between hit genes
+6. Generate visualization of hit genes in pathway context
+""")
+```
+
+---
+
+## Single-Cell RNA-seq Analysis
+
+### Example 1: Cell Type Annotation
+
+**Task:** Analyze single-cell RNA-seq data and annotate cell populations.
+
+```python
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+result = agent.go("""
+Analyze single-cell RNA-seq dataset from human PBMC sample.
+
+File: pbmc_data.h5ad (10X Genomics format)
+
+Workflow:
+1. Quality control:
+   - Filter cells with <200 or >5000 detected genes
+   - Remove cells with >20% mitochondrial content
+   - Filter genes detected in <3 cells
+
+2. Normalization and preprocessing:
+   - Normalize to 10,000 reads per cell
+   - Log-transform
+   - Identify highly variable genes
+   - Scale data
+
+3. Dimensionality reduction:
+   - PCA (50 components)
+   - UMAP visualization
+
+4. Clustering:
+   - Leiden algorithm with resolution=0.8
+   - Identify cluster markers (Wilcoxon rank-sum test)
+
+5. Cell type annotation:
+   - Annotate clusters using marker genes:
+     * T cells (CD3D, CD3E)
+     * B cells (CD79A, MS4A1)
+     * NK cells (GNLY, NKG7)
+     * Monocytes (CD14, LYZ)
+     * Dendritic cells (FCER1A, CST3)
+
+6. Generate UMAP plots with annotations and export results
+""")
+
+agent.save_conversation_history("pbmc_scrna_analysis.pdf")
+```
+
+### Example 2: Differential Expression Analysis
+
+```python
+result = agent.go("""
+Perform differential expression analysis between conditions in scRNA-seq data.
+
+Data: pbmc_treated_vs_control.h5ad
+Conditions: treated (drug X) vs control
+
+Tasks:
+1. Identify differentially expressed genes for each cell type
+2. Use statistical tests appropriate for scRNA-seq (MAST or Wilcoxon)
+3. Apply multiple testing correction (Benjamini-Hochberg)
+4. Threshold: |log2FC| > 0.5, adjusted p < 0.05
+5. Perform pathway enrichment on DE genes per cell type
+6. Identify cell-type-specific drug responses
+7. Generate volcano plots and heatmaps
+""")
+```
+
+### Example 3: Trajectory Analysis
+
+```python
+result = agent.go("""
+Perform pseudotime trajectory analysis on differentiation dataset.
+
+Data: hematopoiesis_scrna.h5ad
+Starting population: Hematopoietic stem cells (HSCs)
+
+Analysis:
+1. Subset to hematopoietic lineages
+2. Compute diffusion map or PAGA for trajectory inference
+3. Order cells along pseudotime
+4. Identify genes with dynamic expression along trajectory
+5. Cluster genes by expression patterns
+6. Map trajectories to known differentiation pathways
+7. Visualize key transcription factors driving differentiation
+""")
+```
+
+---
+
+## Drug Discovery and ADMET
+
+### Example 1: ADMET Property Prediction
+
+**Task:** Predict ADMET properties for drug candidates.
+
+```python
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+result = agent.go("""
+Predict ADMET properties for these drug candidates:
+
+Compounds (SMILES format):
+1. CC1=C(C=C(C=C1)NC(=O)C2=CC=C(C=C2)CN3CCN(CC3)C)NC4=NC=CC(=N4)C5=CN=CC=C5
+2. CN1CCN(CC1)C2=C(C=C3C(=C2)N=CN=C3NC4=CC=C(C=C4)F)OC
+3. CC(C)(C)NC(=O)N(CC1=CC=CC=C1)C2CCN(CC2)C(=O)C3=CC4=C(C=C3)OCO4
+
+For each compound, predict:
+
+**Absorption:**
+- Caco-2 permeability (cm/s)
+- Human intestinal absorption (HIA %)
+- Oral bioavailability
+
+**Distribution:**
+- Plasma protein binding (%)
+- Blood-brain barrier penetration (BBB+/-)
+- Volume of distribution (L/kg)
+
+**Metabolism:**
+- CYP450 substrate/inhibitor predictions (2D6, 3A4, 2C9, 2C19)
+- Metabolic stability (T1/2)
+
+**Excretion:**
+- Clearance (mL/min/kg)
+- Half-life (hours)
+
+**Toxicity:**
+- hERG IC50 (cardiotoxicity risk)
+- Hepatotoxicity prediction
+- Ames mutagenicity
+- LD50 estimates
+
+Provide predictions with confidence scores and flag any red flags.
+""")
+
+agent.save_conversation_history("admet_predictions.pdf")
+```
+
+### Example 2: Target Identification
+
+```python
+result = agent.go("""
+Identify potential protein targets for Alzheimer's disease drug development.
+
+Tasks:
+1. Query GWAS data for Alzheimer's-associated genes
+2. Identify genes with druggable domains (kinases, GPCRs, ion channels, etc.)
+3. Check for brain expression patterns
+4. Assess disease relevance via literature mining
+5. Evaluate existing chemical probe availability
+6. Rank targets by:
+   - Genetic evidence strength
+   - Druggability
+   - Lack of existing therapies
+7. Suggest target validation experiments
+""")
+```
+
+### Example 3: Virtual Screening
+
+```python
+result = agent.go("""
+Perform virtual screening for EGFR kinase inhibitors.
+
+Database: ZINC15 lead-like subset (~6M compounds)
+Target: EGFR kinase domain (PDB: 1M17)
+
+Workflow:
+1. Prepare protein structure (remove waters, add hydrogens)
+2. Define binding pocket (based on erlotinib binding site)
+3. Generate pharmacophore model from known EGFR inhibitors
+4. Filter ZINC database by:
+   - Molecular weight: 200-500 Da
+   - LogP: 0-5
+   - Lipinski's rule of five
+   - Pharmacophore match
+5. Dock top 10,000 compounds
+6. Score by docking energy and predicted binding affinity
+7. Select top 100 for further analysis
+8. Predict ADMET properties for top hits
+9. Recommend top 10 compounds for experimental validation
+""")
+```
+
+---
+
+## GWAS and Genetic Analysis
+
+### Example 1: GWAS Summary Statistics Analysis
+
+**Task:** Interpret GWAS results and identify causal genes.
+
+```python
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+result = agent.go("""
+Analyze GWAS summary statistics for Type 2 Diabetes.
+
+Input file: t2d_gwas_summary.txt
+Columns: CHR, BP, SNP, P, OR, BETA, SE, A1, A2
+
+Analysis steps:
+1. Identify genome-wide significant variants (P < 5e-8)
+2. Perform LD clumping to identify independent signals
+3. Map variants to genes using:
+   - Nearest gene
+   - eQTL databases (GTEx)
+   - Hi-C chromatin interactions
+4. Prioritize causal genes using multiple evidence:
+   - Fine-mapping scores
+   - Coding variant consequences
+   - Gene expression in relevant tissues (pancreas, liver, adipose)
+   - Pathway enrichment
+5. Identify druggable targets among causal genes
+6. Compare with known T2D genes and highlight novel associations
+7. Generate Manhattan plot, QQ plot, and gene prioritization table
+""")
+
+agent.save_conversation_history("t2d_gwas_analysis.pdf")
+```
+
+### Example 2: Polygenic Risk Score
+
+```python
+result = agent.go("""
+Develop and validate polygenic risk score (PRS) for coronary artery disease (CAD).
+
+Training GWAS: CAD_discovery_summary_stats.txt (N=180,000)
+Validation cohort: CAD_validation_genotypes.vcf (N=50,000)
+
+Tasks:
+1. Select variants for PRS using p-value thresholding (P < 1e-5)
+2. Perform LD clumping (r² < 0.1, 500kb window)
+3. Calculate PRS weights from GWAS betas
+4. Compute PRS for validation cohort individuals
+5. Evaluate PRS performance:
+   - AUC for CAD case/control discrimination
+   - Odds ratios across PRS deciles
+   - Compare to traditional risk factors (age, sex, BMI, smoking)
+6. Assess PRS calibration and create risk stratification plot
+7. Identify high-risk individuals (top 5% PRS)
+""")
+```
+
+### Example 3: Variant Pathogenicity Prediction
+
+```python
+result = agent.go("""
+Predict pathogenicity of rare coding variants in candidate disease genes.
+
+Variants (VCF format):
+- chr17:41234451:A>G (BRCA1 p.Arg1347Gly)
+- chr2:179428448:C>T (TTN p.Trp13579*)
+- chr7:117188679:G>A (CFTR p.Gly542Ser)
+
+For each variant, assess:
+1. In silico predictions (SIFT, PolyPhen2, CADD, REVEL)
+2. Population frequency (gnomAD)
+3. Evolutionary conservation (PhyloP, PhastCons)
+4. Protein structure impact (using AlphaFold structures)
+5. Functional domain location
+6. ClinVar annotations (if available)
+7. Literature evidence
+8. ACMG/AMP classification criteria
+
+Provide pathogenicity classification (benign, likely benign, VUS, likely pathogenic, pathogenic) with supporting evidence.
+""")
+```
+
+---
+
+## Clinical Genomics and Diagnostics
+
+### Example 1: Rare Disease Diagnosis
+
+**Task:** Diagnose rare genetic disease from whole exome sequencing.
+
+```python
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+result = agent.go("""
+Analyze whole exome sequencing (WES) data for rare disease diagnosis.
+
+Patient phenotypes (HPO terms):
+- HP:0001250 (Seizures)
+- HP:0001249 (Intellectual disability)
+- HP:0001263 (Global developmental delay)
+- HP:0001252 (Hypotonia)
+
+VCF file: patient_trio.vcf (proband + parents)
+
+Analysis workflow:
+1. Variant filtering:
+   - Quality filters (QUAL > 30, DP > 10, GQ > 20)
+   - Frequency filters (gnomAD AF < 0.01)
+   - Functional impact (missense, nonsense, frameshift, splice site)
+
+2. Inheritance pattern analysis:
+   - De novo variants
+   - Autosomal recessive (compound het, homozygous)
+   - X-linked
+
+3. Phenotype-driven prioritization:
+   - Match candidate genes to HPO terms
+   - Use HPO-gene associations
+   - Check gene expression in relevant tissues (brain)
+
+4. Variant pathogenicity assessment:
+   - In silico predictions
+   - ACMG classification
+   - Literature evidence
+
+5. Generate diagnostic report with:
+   - Top candidate variants
+   - Supporting evidence
+   - Functional validation suggestions
+   - Genetic counseling recommendations
+""")
+
+agent.save_conversation_history("rare_disease_diagnosis.pdf")
+```
+
+### Example 2: Cancer Genomics Analysis
+
+```python
+result = agent.go("""
+Analyze tumor-normal paired sequencing for cancer genomics.
+
+Files:
+- tumor_sample.vcf (somatic variants)
+- tumor_rnaseq.bam (gene expression)
+- tumor_cnv.seg (copy number variants)
+
+Analysis:
+1. Identify driver mutations:
+   - Known cancer genes (COSMIC, OncoKB)
+   - Recurrent hotspot mutations
+   - Truncating mutations in tumor suppressors
+
+2. Analyze mutational signatures:
+   - Decompose signatures (COSMIC signatures)
+   - Identify mutagenic processes
+
+3. Copy number analysis:
+   - Identify amplifications and deletions
+   - Focal vs. arm-level events
+   - Assess oncogene amplifications and TSG deletions
+
+4. Gene expression analysis:
+   - Identify outlier gene expression
+   - Fusion transcript detection
+   - Pathway dysregulation
+
+5. Therapeutic implications:
+   - Match alterations to FDA-approved therapies
+   - Identify clinical trial opportunities
+   - Predict response to targeted therapies
+
+6. Generate precision oncology report
+""")
+```
+
+### Example 3: Pharmacogenomics
+
+```python
+result = agent.go("""
+Generate pharmacogenomics report for patient genotype data.
+
+VCF file: patient_pgx.vcf
+
+Analyze variants affecting drug metabolism:
+
+**CYP450 genes:**
+- CYP2D6 (affects ~25% of drugs)
+- CYP2C19 (clopidogrel, PPIs, antidepressants)
+- CYP2C9 (warfarin, NSAIDs)
+- CYP3A5 (tacrolimus, immunosuppressants)
+
+**Drug transporter genes:**
+- SLCO1B1 (statin myopathy risk)
+- ABCB1 (P-glycoprotein)
+
+**Drug targets:**
+- VKORC1 (warfarin dosing)
+- DPYD (fluoropyrimidine toxicity)
+- TPMT (thiopurine toxicity)
+
+For each gene:
+1. Determine diplotype (*1/*1, *1/*2, etc.)
+2. Assign metabolizer phenotype (PM, IM, NM, RM, UM)
+3. Provide dosing recommendations using CPIC/PharmGKB guidelines
+4. Flag high-risk drug-gene interactions
+5. Suggest alternative medications if needed
+
+Generate patient-friendly report with actionable recommendations.
+""")
+```
+
+---
+
+## Protein Structure and Function
+
+### Example 1: AlphaFold Structure Analysis
+
+```python
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+result = agent.go("""
+Analyze AlphaFold structure prediction for novel protein.
+
+Protein: Hypothetical protein ABC123 (UniProt: Q9XYZ1)
+
+Tasks:
+1. Retrieve AlphaFold structure from database
+2. Assess prediction quality:
+   - pLDDT scores per residue
+   - Identify high-confidence regions (pLDDT > 90)
+   - Flag low-confidence regions (pLDDT < 50)
+
+3. Structural analysis:
+   - Identify domains using structural alignment
+   - Predict fold family
+   - Identify secondary structure elements
+
+4. Functional prediction:
+   - Search for structural homologs in PDB
+   - Identify conserved functional sites
+   - Predict binding pockets
+   - Suggest possible ligands/substrates
+
+5. Variant impact analysis:
+   - Map disease-associated variants to structure
+   - Predict structural consequences
+   - Identify variants affecting binding sites
+
+6. Generate PyMOL visualization scripts highlighting key features
+""")
+
+agent.save_conversation_history("alphafold_analysis.pdf")
+```
+
+### Example 2: Protein-Protein Interaction Prediction
+
+```python
+result = agent.go("""
+Predict and analyze protein-protein interactions for autophagy pathway.
+
+Query proteins: ATG5, ATG12, ATG16L1
+
+Analysis:
+1. Retrieve known interactions from:
+   - STRING database
+   - BioGRID
+   - IntAct
+   - Literature mining
+
+2. Predict novel interactions using:
+   - Structural modeling (AlphaFold-Multimer)
+   - Coexpression analysis
+   - Phylogenetic profiling
+
+3. Analyze interaction interfaces:
+   - Identify binding residues
+   - Assess interface properties (area, hydrophobicity)
+   - Predict binding affinity
+
+4. Functional analysis:
+   - Map interactions to autophagy pathway steps
+   - Identify regulatory interactions
+   - Predict complex stoichiometry
+
+5. Therapeutic implications:
+   - Identify druggable interfaces
+   - Suggest peptide inhibitors
+   - Design disruption strategies
+
+Generate network visualization and interaction details.
+""")
+```
+
+---
+
+## Literature and Knowledge Synthesis
+
+### Example 1: Systematic Literature Review
+
+```python
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+result = agent.go("""
+Perform systematic literature review on CRISPR base editing applications.
+
+Search query: "CRISPR base editing" OR "base editor" OR "CBE" OR "ABE"
+Date range: 2016-2025
+
+Tasks:
+1. Search PubMed and retrieve relevant abstracts
+2. Filter for original research articles
+3. Extract key information:
+   - Base editor type (CBE, ABE, dual)
+   - Target organism/cell type
+   - Application (disease model, therapy, crop improvement)
+   - Editing efficiency
+   - Off-target assessment
+
+4. Categorize applications:
+   - Therapeutic applications (by disease)
+   - Agricultural applications
+   - Basic research
+
+5. Analyze trends:
+   - Publications over time
+   - Most studied diseases
+   - Evolution of base editor technology
+
+6. Synthesize findings:
+   - Clinical trial status
+   - Remaining challenges
+   - Future directions
+
+Generate comprehensive review document with citation statistics.
+""")
+
+agent.save_conversation_history("crispr_base_editing_review.pdf")
+```
+
+### Example 2: Gene Function Synthesis
+
+```python
+result = agent.go("""
+Synthesize knowledge about gene function from multiple sources.
+
+Target gene: PARK7 (DJ-1)
+
+Integrate information from:
+1. **Genetic databases:**
+   - NCBI Gene
+   - UniProt
+   - OMIM
+
+2. **Expression data:**
+   - GTEx tissue expression
+   - Human Protein Atlas
+   - Single-cell expression atlases
+
+3. **Functional data:**
+   - GO annotations
+   - KEGG pathways
+   - Reactome
+   - Protein interactions (STRING)
+
+4. **Disease associations:**
+   - ClinVar variants
+   - GWAS catalog
+   - Disease databases (DisGeNET)
+
+5. **Literature:**
+   - PubMed abstracts
+   - Key mechanistic studies
+   - Review articles
+
+Synthesize into comprehensive gene report:
+- Molecular function
+- Biological processes
+- Cellular localization
+- Tissue distribution
+- Disease associations
+- Known drug targets/inhibitors
+- Unresolved questions
+
+Generate structured summary suitable for research planning.
+""")
+```
+
+---
+
+## Multi-Omics Integration
+
+### Example 1: Multi-Omics Disease Analysis
+
+```python
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
+
+result = agent.go("""
+Integrate multi-omics data to understand disease mechanism.
+
+Disease: Alzheimer's disease
+Data types:
+- Genomics: GWAS summary statistics (gwas_ad.txt)
+- Transcriptomics: Brain RNA-seq (controls vs AD, rnaseq_data.csv)
+- Proteomics: CSF proteomics (proteomics_csf.csv)
+- Metabolomics: Plasma metabolomics (metabolomics_plasma.csv)
+- Epigenomics: Brain methylation array (methylation_data.csv)
+
+Integration workflow:
+1. Analyze each omics layer independently:
+   - Identify significantly altered features
+   - Perform pathway enrichment
+
+2. Cross-omics correlation:
+   - Correlate gene expression with protein levels
+   - Link genetic variants to expression (eQTL)
+   - Associate methylation with gene expression
+   - Connect proteins to metabolites
+
+3. Network analysis:
+   - Build multi-omics network
+   - Identify key hub genes/proteins
+   - Detect disease modules
+
+4. Causal inference:
+   - Prioritize drivers vs. consequences
+   - Identify therapeutic targets
+   - Predict drug mechanisms
+
+5. Generate integrative model of AD pathogenesis
+
+Provide visualization and therapeutic target recommendations.
+""")
+
+agent.save_conversation_history("ad_multiomics_analysis.pdf")
+```
+
+### Example 2: Systems Biology Modeling
+
+```python
+result = agent.go("""
+Build systems biology model of metabolic pathway.
+
+Pathway: Glycolysis
+Data sources:
+- Enzyme kinetics (BRENDA database)
+- Metabolite concentrations (literature)
+- Gene expression (tissue-specific, GTEx)
+- Flux measurements (C13 labeling studies)
+
+Modeling tasks:
+1. Construct pathway model:
+   - Define reactions and stoichiometry
+   - Parameterize enzyme kinetics (Km, Vmax, Ki)
+   - Set initial metabolite concentrations
+
+2. Simulate pathway dynamics:
+   - Steady-state analysis
+   - Time-course simulations
+   - Sensitivity analysis
+
+3. Constraint-based modeling:
+   - Flux balance analysis (FBA)
+   - Identify bottleneck reactions
+   - Predict metabolic engineering strategies
+
+4. Integrate with gene expression:
+   - Tissue-specific model predictions
+   - Disease vs. normal comparisons
+
+5. Therapeutic predictions:
+   - Enzyme inhibition effects
+   - Metabolic rescue strategies
+   - Drug target identification
+
+Generate model in SBML format and simulation results.
+""")
+```
+
+---
+
+## Best Practices for Task Formulation
+
+### 1. Be Specific and Detailed
+
+**Poor:**
+```python
+agent.go("Analyze this RNA-seq data")
+```
+
+**Good:**
+```python
+agent.go("""
+Analyze bulk RNA-seq data from cancer vs. normal samples.
+
+Files: cancer_rnaseq.csv (TPM values, 50 cancer, 50 normal)
+
+Tasks:
+1. Differential expression (DESeq2, padj < 0.05, |log2FC| > 1)
+2. Pathway enrichment (KEGG, Reactome)
+3. Generate volcano plot and top DE gene heatmap
+""")
+```
+
+### 2. Include File Paths and Formats
+
+Always specify:
+- Exact file paths
+- File formats (VCF, BAM, CSV, H5AD, etc.)
+- Data structure (columns, sample IDs)
+
+### 3. Set Clear Success Criteria
+
+Define thresholds and cutoffs:
+- Statistical significance (P < 0.05, FDR < 0.1)
+- Fold change thresholds
+- Quality filters
+- Expected outputs
+
+### 4. Request Visualizations
+
+Explicitly ask for plots:
+- Volcano plots, MA plots
+- Heatmaps, PCA plots
+- Network diagrams
+- Manhattan plots
+
+### 5. Specify Biological Context
+
+Include:
+- Organism (human, mouse, etc.)
+- Tissue/cell type
+- Disease/condition
+- Treatment details
+
+### 6. Request Interpretations
+
+Ask agent to:
+- Interpret biological significance
+- Suggest follow-up experiments
+- Identify limitations
+- Provide literature context
+
+---
+
+## Common Patterns
+
+### Data Quality Control
+
+```python
+"""
+Before analysis, perform quality control:
+1. Check for missing values
+2. Assess data distributions
+3. Identify outliers
+4. Generate QC report
+Only proceed with analysis if data passes QC.
+"""
+```
+
+### Iterative Refinement
+
+```python
+"""
+Perform analysis in stages:
+1. Initial exploratory analysis
+2. Based on results, refine parameters
+3. Focus on interesting findings
+4. Generate final report
+
+Show intermediate results for each stage.
+"""
+```
+
+### Reproducibility
+
+```python
+"""
+Ensure reproducibility:
+1. Set random seeds where applicable
+2. Log all parameters used
+3. Save intermediate files
+4. Export environment info (package versions)
+5. Generate methods section for paper
+"""
+```
+
+These examples demonstrate the breadth of biomedical tasks biomni can handle. Adapt the patterns to your specific research questions, and always include sufficient detail for the agent to execute autonomously.
--- a/scientific-packages/biomni/scripts/generate_report.py
+++ b/scientific-packages/biomni/scripts/generate_report.py
@@ -1,381 +1,370 @@
 #!/usr/bin/env python3
 """
-Enhanced PDF Report Generation for Biomni
+Enhanced PDF report generation for biomni conversation histories.

-This script provides advanced PDF report generation with custom formatting,
-styling, and metadata for Biomni analysis results.
+This script provides additional customization options for biomni reports:
+- Custom styling and branding
+- Formatted code blocks
+- Section organization
+- Metadata inclusion
+- Export format options (PDF, HTML, Markdown)
+
+Usage:
+    python generate_report.py --input conversation.json --output report.pdf
+    python generate_report.py --agent-object agent --output report.pdf --format html
 """

 import argparse
-import sys
+import json
 from pathlib import Path
+from typing import Dict, List, Optional, Any
 from datetime import datetime
-from typing import Optional, Dict, Any


-def generate_markdown_report(
-    title: str,
-    sections: list,
-    metadata: Optional[Dict[str, Any]] = None,
-    output_path: str = "report.md"
+def format_conversation_history(
+    messages: List[Dict[str, Any]],
+    include_metadata: bool = True,
+    include_code: bool = True,
+    include_timestamps: bool = False
 ) -> str:
    """
-    Generate a formatted markdown report.
+    Format conversation history into structured markdown.

    Args:
-        title: Report title
-        sections: List of dicts with 'heading' and 'content' keys
-        metadata: Optional metadata dict (author, date, etc.)
-        output_path: Path to save markdown file
+        messages: List of conversation message dictionaries
+        include_metadata: Include metadata section
+        include_code: Include code blocks
+        include_timestamps: Include message timestamps

    Returns:
-        Path to generated markdown file
+        Formatted markdown string
    """
-    md_content = []
-
-    # Title
-    md_content.append(f"# {title}\n")
-
-    # Metadata
-    if metadata:
-        md_content.append("---\n")
-        for key, value in metadata.items():
-            md_content.append(f"**{key}:** {value}  \n")
-        md_content.append("---\n\n")
-
-    # Sections
-    for section in sections:
-        heading = section.get('heading', 'Section')
-        content = section.get('content', '')
-        level = section.get('level', 2)  # Default to h2
-
-        md_content.append(f"{'#' * level} {heading}\n\n")
-        md_content.append(f"{content}\n\n")
-
-    # Write to file
-    output = Path(output_path)
-    output.write_text('\n'.join(md_content))
-
-    return str(output)
-
-
-def convert_to_pdf_weasyprint(
-    markdown_path: str,
-    output_path: str,
-    css_style: Optional[str] = None
-) -> bool:
-    """
-    Convert markdown to PDF using WeasyPrint.
-
-    Args:
-        markdown_path: Path to markdown file
-        output_path: Path for output PDF
-        css_style: Optional CSS stylesheet path
-
-    Returns:
-        True if successful, False otherwise
-    """
-    try:
-        import markdown
-        from weasyprint import HTML, CSS
-
-        # Read markdown
-        with open(markdown_path, 'r') as f:
-            md_content = f.read()
-
-        # Convert to HTML
-        html_content = markdown.markdown(
-            md_content,
-            extensions=['tables', 'fenced_code', 'codehilite']
-        )
-
-        # Wrap in HTML template
-        html_template = f"""
-        <!DOCTYPE html>
-        <html>
-        <head>
-            <meta charset="utf-8">
-            <title>Biomni Report</title>
-            <style>
-                body {{
-                    font-family: 'Helvetica', 'Arial', sans-serif;
-                    line-height: 1.6;
-                    color: #333;
-                    max-width: 800px;
-                    margin: 40px auto;
-                    padding: 20px;
-                }}
-                h1 {{
-                    color: #2c3e50;
-                    border-bottom: 3px solid #3498db;
-                    padding-bottom: 10px;
-                }}
-                h2 {{
-                    color: #34495e;
-                    margin-top: 30px;
-                    border-bottom: 1px solid #bdc3c7;
-                    padding-bottom: 5px;
-                }}
-                h3 {{
-                    color: #7f8c8d;
-                }}
-                code {{
-                    background-color: #f4f4f4;
-                    padding: 2px 6px;
-                    border-radius: 3px;
-                    font-family: 'Courier New', monospace;
-                }}
-                pre {{
-                    background-color: #f4f4f4;
-                    padding: 15px;
-                    border-radius: 5px;
-                    overflow-x: auto;
-                }}
-                table {{
-                    border-collapse: collapse;
-                    width: 100%;
-                    margin: 20px 0;
-                }}
-                th, td {{
-                    border: 1px solid #ddd;
-                    padding: 12px;
-                    text-align: left;
-                }}
-                th {{
-                    background-color: #3498db;
-                    color: white;
-                }}
-                tr:nth-child(even) {{
-                    background-color: #f9f9f9;
-                }}
-                .metadata {{
-                    background-color: #ecf0f1;
-                    padding: 15px;
-                    border-radius: 5px;
-                    margin: 20px 0;
-                }}
-            </style>
-        </head>
-        <body>
-            {html_content}
-        </body>
-        </html>
-        """
-
-        # Generate PDF
-        pdf = HTML(string=html_template)
-
-        # Add custom CSS if provided
-        stylesheets = []
-        if css_style and Path(css_style).exists():
-            stylesheets.append(CSS(filename=css_style))
-
-        pdf.write_pdf(output_path, stylesheets=stylesheets)
-
-        return True
-
-    except ImportError:
-        print("Error: WeasyPrint not installed. Install with: pip install weasyprint")
-        return False
-    except Exception as e:
-        print(f"Error generating PDF: {e}")
-        return False
-
-
-def convert_to_pdf_pandoc(markdown_path: str, output_path: str) -> bool:
-    """
-    Convert markdown to PDF using Pandoc.
-
-    Args:
-        markdown_path: Path to markdown file
-        output_path: Path for output PDF
-
-    Returns:
-        True if successful, False otherwise
-    """
-    try:
-        import subprocess
-
-        # Check if pandoc is installed
-        result = subprocess.run(
-            ['pandoc', '--version'],
-            capture_output=True,
-            text=True
-        )
-
-        if result.returncode != 0:
-            print("Error: Pandoc not installed")
-            return False
-
-        # Convert with pandoc
-        result = subprocess.run(
-            [
-                'pandoc',
-                markdown_path,
-                '-o', output_path,
-                '--pdf-engine=pdflatex',
-                '-V', 'geometry:margin=1in',
-                '--toc'
-            ],
-            capture_output=True,
-            text=True
-        )
-
-        if result.returncode != 0:
-            print(f"Pandoc error: {result.stderr}")
-            return False
-
-        return True
-
-    except FileNotFoundError:
-        print("Error: Pandoc not found. Install from https://pandoc.org/")
-        return False
-    except Exception as e:
-        print(f"Error: {e}")
-        return False
-
-
-def create_biomni_report(
-    conversation_history: list,
-    output_path: str = "biomni_report.pdf",
-    method: str = "weasyprint"
-) -> bool:
-    """
-    Create a formatted PDF report from Biomni conversation history.
-
-    Args:
-        conversation_history: List of conversation turns
-        output_path: Output PDF path
-        method: Conversion method ('weasyprint' or 'pandoc')
-
-    Returns:
-        True if successful
-    """
-    # Prepare report sections
-    metadata = {
-        'Date': datetime.now().strftime('%Y-%m-%d %H:%M:%S'),
-        'Tool': 'Biomni AI Agent',
-        'Report Type': 'Analysis Summary'
-    }
-
    sections = []

-    # Executive Summary
-    sections.append({
-        'heading': 'Executive Summary',
-        'level': 2,
-        'content': 'This report contains the complete analysis workflow executed by the Biomni biomedical AI agent.'
-    })
+    # Header
+    sections.append("# Biomni Analysis Report\n")

-    # Conversation history
-    for i, turn in enumerate(conversation_history, 1):
-        sections.append({
-            'heading': f'Task {i}: {turn.get("task", "Analysis")}',
-            'level': 2,
-            'content': f'**Input:**\n```\n{turn.get("input", "")}\n```\n\n**Output:**\n{turn.get("output", "")}'
-        })
+    # Metadata
+    if include_metadata:
+        sections.append("## Metadata\n")
+        sections.append(f"- **Generated**: {datetime.now().strftime('%Y-%m-%d %H:%M:%S')}")
+        sections.append(f"- **Number of interactions**: {len(messages)}")
+        sections.append("\n---\n")
+
+    # Process messages
+    sections.append("## Analysis\n")
+
+    for i, msg in enumerate(messages, 1):
+        role = msg.get('role', 'unknown')
+        content = msg.get('content', '')
+
+        if role == 'user':
+            sections.append(f"### Task {i // 2 + 1}\n")
+            sections.append(f"**Query:**\n```\n{content}\n```\n")
+
+        elif role == 'assistant':
+            sections.append(f"**Response:**\n")
+
+            # Check if content contains code
+            if include_code and ('```' in content or 'import ' in content):
+                # Attempt to separate text and code
+                parts = content.split('```')
+                for j, part in enumerate(parts):
+                    if j % 2 == 0:
+                        # Text content
+                        if part.strip():
+                            sections.append(f"{part.strip()}\n")
+                    else:
+                        # Code content
+                        # Check if language is specified
+                        lines = part.split('\n', 1)
+                        if len(lines) > 1 and lines[0].strip() in ['python', 'r', 'bash', 'sql']:
+                            lang = lines[0].strip()
+                            code = lines[1]
+                        else:
+                            lang = 'python'  # Default to python
+                            code = part
+
+                        sections.append(f"```{lang}\n{code}\n```\n")
+            else:
+                sections.append(f"{content}\n")
+
+            sections.append("\n---\n")
+
+    return '\n'.join(sections)
+
+
+def markdown_to_html(markdown_content: str, title: str = "Biomni Report") -> str:
+    """
+    Convert markdown to styled HTML.
+
+    Args:
+        markdown_content: Markdown string
+        title: HTML page title
+
+    Returns:
+        HTML string
+    """
+    # Simple markdown to HTML conversion
+    # For production use, consider using a library like markdown or mistune
+
+    html_template = f"""
+<!DOCTYPE html>
+<html lang="en">
+<head>
+    <meta charset="UTF-8">
+    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+    <title>{title}</title>
+    <style>
+        body {{
+            font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, sans-serif;
+            line-height: 1.6;
+            max-width: 900px;
+            margin: 0 auto;
+            padding: 20px;
+            color: #333;
+        }}
+        h1 {{
+            color: #2c3e50;
+            border-bottom: 3px solid #3498db;
+            padding-bottom: 10px;
+        }}
+        h2 {{
+            color: #34495e;
+            margin-top: 30px;
+            border-bottom: 2px solid #95a5a6;
+            padding-bottom: 5px;
+        }}
+        h3 {{
+            color: #555;
+        }}
+        code {{
+            background-color: #f4f4f4;
+            padding: 2px 6px;
+            border-radius: 3px;
+            font-family: 'Monaco', 'Menlo', 'Courier New', monospace;
+        }}
+        pre {{
+            background-color: #f8f8f8;
+            border: 1px solid #ddd;
+            border-radius: 5px;
+            padding: 15px;
+            overflow-x: auto;
+        }}
+        pre code {{
+            background-color: transparent;
+            padding: 0;
+        }}
+        hr {{
+            border: none;
+            border-top: 1px solid #ddd;
+            margin: 30px 0;
+        }}
+        .metadata {{
+            background-color: #ecf0f1;
+            padding: 15px;
+            border-radius: 5px;
+            margin-bottom: 20px;
+        }}
+        .task {{
+            background-color: #e8f4f8;
+            padding: 10px;
+            border-left: 4px solid #3498db;
+            margin: 20px 0;
+        }}
+        .footer {{
+            margin-top: 50px;
+            text-align: center;
+            color: #7f8c8d;
+            font-size: 0.9em;
+        }}
+    </style>
+</head>
+<body>
+    <div class="content">
+        {markdown_to_html_simple(markdown_content)}
+    </div>
+    <div class="footer">
+        <p>Generated with Biomni | Stanford SNAP Lab</p>
+        <p><a href="https://github.com/snap-stanford/biomni">github.com/snap-stanford/biomni</a></p>
+    </div>
+</body>
+</html>
+"""
+    return html_template
+
+
+def markdown_to_html_simple(md: str) -> str:
+    """Simple markdown to HTML converter (basic implementation)."""
+    lines = md.split('\n')
+    html_lines = []
+    in_code_block = False
+    in_list = False
+
+    for line in lines:
+        # Code blocks
+        if line.startswith('```'):
+            if in_code_block:
+                html_lines.append('</code></pre>')
+                in_code_block = False
+            else:
+                lang = line[3:].strip()
+                html_lines.append(f'<pre><code class="language-{lang}">')
+                in_code_block = True
+            continue
+
+        if in_code_block:
+            html_lines.append(line)
+            continue
+
+        # Headers
+        if line.startswith('# '):
+            html_lines.append(f'<h1>{line[2:]}</h1>')
+        elif line.startswith('## '):
+            html_lines.append(f'<h2>{line[3:]}</h2>')
+        elif line.startswith('### '):
+            html_lines.append(f'<h3>{line[4:]}</h3>')
+        # Lists
+        elif line.startswith('- '):
+            if not in_list:
+                html_lines.append('<ul>')
+                in_list = True
+            html_lines.append(f'<li>{line[2:]}</li>')
+        else:
+            if in_list:
+                html_lines.append('</ul>')
+                in_list = False
+
+            # Horizontal rule
+            if line.strip() == '---':
+                html_lines.append('<hr>')
+            # Bold
+            elif '**' in line:
+                line = line.replace('**', '<strong>', 1).replace('**', '</strong>', 1)
+                html_lines.append(f'<p>{line}</p>')
+            # Regular paragraph
+            elif line.strip():
+                html_lines.append(f'<p>{line}</p>')
+            else:
+                html_lines.append('<br>')
+
+    if in_list:
+        html_lines.append('</ul>')
+
+    return '\n'.join(html_lines)
+
+
+def generate_report(
+    conversation_data: Dict[str, Any],
+    output_path: Path,
+    format: str = 'markdown',
+    title: Optional[str] = None
+):
+    """
+    Generate formatted report from conversation data.
+
+    Args:
+        conversation_data: Conversation history dictionary
+        output_path: Output file path
+        format: Output format ('markdown', 'html', or 'pdf')
+        title: Report title
+    """
+    messages = conversation_data.get('messages', [])
+
+    if not title:
+        title = f"Biomni Analysis - {datetime.now().strftime('%Y-%m-%d')}"

    # Generate markdown
-    md_path = output_path.replace('.pdf', '.md')
-    generate_markdown_report(
-        title="Biomni Analysis Report",
-        sections=sections,
-        metadata=metadata,
-        output_path=md_path
-    )
+    markdown_content = format_conversation_history(messages)
+
+    if format == 'markdown':
+        output_path.write_text(markdown_content)
+        print(f"✓ Markdown report saved to {output_path}")
+
+    elif format == 'html':
+        html_content = markdown_to_html(markdown_content, title)
+        output_path.write_text(html_content)
+        print(f"✓ HTML report saved to {output_path}")
+
+    elif format == 'pdf':
+        # For PDF generation, we'd typically use a library like weasyprint or reportlab
+        # This is a placeholder implementation
+        print("PDF generation requires additional dependencies (weasyprint or reportlab)")
+        print("Falling back to HTML format...")
+
+        html_path = output_path.with_suffix('.html')
+        html_content = markdown_to_html(markdown_content, title)
+        html_path.write_text(html_content)
+
+        print(f"✓ HTML report saved to {html_path}")
+        print("  To convert to PDF:")
+        print(f"    1. Install weasyprint: pip install weasyprint")
+        print(f"    2. Run: weasyprint {html_path} {output_path}")

-    # Convert to PDF
-    if method == 'weasyprint':
-        success = convert_to_pdf_weasyprint(md_path, output_path)
-    elif method == 'pandoc':
-        success = convert_to_pdf_pandoc(md_path, output_path)
    else:
-        print(f"Unknown method: {method}")
-        return False
-
-    if success:
-        print(f"✓ Report generated: {output_path}")
-        print(f"  Markdown: {md_path}")
-    else:
-        print("✗ Failed to generate PDF")
-        print(f"  Markdown available: {md_path}")
-
-    return success
+        raise ValueError(f"Unsupported format: {format}")


 def main():
-    """CLI for report generation."""
+    """Main entry point for CLI usage."""
    parser = argparse.ArgumentParser(
-        description='Generate formatted PDF reports for Biomni analyses'
+        description="Generate enhanced reports from biomni conversation histories"
    )

    parser.add_argument(
-        'input',
-        type=str,
-        help='Input markdown file or conversation history'
+        '--input',
+        type=Path,
+        required=True,
+        help='Input conversation history JSON file'
    )

    parser.add_argument(
-        '-o', '--output',
-        type=str,
-        default='biomni_report.pdf',
-        help='Output PDF path (default: biomni_report.pdf)'
+        '--output',
+        type=Path,
+        required=True,
+        help='Output report file path'
    )

    parser.add_argument(
-        '-m', '--method',
-        type=str,
-        choices=['weasyprint', 'pandoc'],
-        default='weasyprint',
-        help='Conversion method (default: weasyprint)'
+        '--format',
+        choices=['markdown', 'html', 'pdf'],
+        default='markdown',
+        help='Output format (default: markdown)'
    )

    parser.add_argument(
-        '--css',
+        '--title',
        type=str,
-        help='Custom CSS stylesheet path'
+        help='Report title (optional)'
    )

    args = parser.parse_args()

-    # Check if input is markdown or conversation history
-    input_path = Path(args.input)
-
-    if not input_path.exists():
-        print(f"Error: Input file not found: {args.input}")
-        return 1
-
-    # If input is markdown, convert directly
-    if input_path.suffix == '.md':
-        if args.method == 'weasyprint':
-            success = convert_to_pdf_weasyprint(
-                str(input_path),
-                args.output,
-                args.css
-            )
-        else:
-            success = convert_to_pdf_pandoc(str(input_path), args.output)
-
-        return 0 if success else 1
-
-    # Otherwise, assume it's conversation history (JSON)
+    # Load conversation data
    try:
-        import json
-        with open(input_path) as f:
-            history = json.load(f)
-
-        success = create_biomni_report(
-            history,
-            args.output,
-            args.method
-        )
-
-        return 0 if success else 1
-
+        with open(args.input, 'r') as f:
+            conversation_data = json.load(f)
+    except FileNotFoundError:
+        print(f"❌ Input file not found: {args.input}")
+        return 1
    except json.JSONDecodeError:
-        print("Error: Input file is not valid JSON or markdown")
+        print(f"❌ Invalid JSON in input file: {args.input}")
+        return 1
+
+    # Generate report
+    try:
+        generate_report(
+            conversation_data,
+            args.output,
+            format=args.format,
+            title=args.title
+        )
+        return 0
+    except Exception as e:
+        print(f"❌ Error generating report: {e}")
        return 1


-if __name__ == "__main__":
+if __name__ == '__main__':
+    import sys
    sys.exit(main())
--- a/scientific-packages/biomni/scripts/setup_environment.py
+++ b/scientific-packages/biomni/scripts/setup_environment.py
@@ -1,230 +1,355 @@
 #!/usr/bin/env python3
 """
-Biomni Environment Setup and Validation Script
+Interactive setup script for biomni environment configuration.

-This script helps users set up and validate their Biomni environment,
-including checking dependencies, API keys, and data availability.
+This script helps users set up:
+1. Conda environment with required dependencies
+2. API keys for LLM providers
+3. Data lake directory configuration
+4. MCP server setup (optional)
+
+Usage:
+    python setup_environment.py
 """

 import os
 import sys
 import subprocess
 from pathlib import Path
-from typing import Dict, List, Tuple
+from typing import Dict, Optional


-def check_python_version() -> Tuple[bool, str]:
-    """Check if Python version is compatible."""
-    version = sys.version_info
-    if version.major == 3 and version.minor >= 8:
-        return True, f"Python {version.major}.{version.minor}.{version.micro} ✓"
-    else:
-        return False, f"Python {version.major}.{version.minor} - requires Python 3.8+"
-
-
-def check_conda_env() -> Tuple[bool, str]:
-    """Check if running in biomni conda environment."""
-    conda_env = os.environ.get('CONDA_DEFAULT_ENV', None)
-    if conda_env == 'biomni_e1':
-        return True, f"Conda environment: {conda_env} ✓"
-    else:
-        return False, f"Not in biomni_e1 environment (current: {conda_env})"
-
-
-def check_package_installed(package: str) -> bool:
-    """Check if a Python package is installed."""
+def check_conda_installed() -> bool:
+    """Check if conda is available in the system."""
    try:
-        __import__(package)
+        subprocess.run(
+            ['conda', '--version'],
+            capture_output=True,
+            check=True
+        )
        return True
-    except ImportError:
+    except (subprocess.CalledProcessError, FileNotFoundError):
        return False


-def check_dependencies() -> Tuple[bool, List[str]]:
-    """Check for required and optional dependencies."""
-    required = ['biomni']
-    optional = ['weasyprint', 'markdown2pdf']
+def setup_conda_environment():
+    """Guide user through conda environment setup."""
+    print("\n=== Conda Environment Setup ===")

-    missing_required = [pkg for pkg in required if not check_package_installed(pkg)]
-    missing_optional = [pkg for pkg in optional if not check_package_installed(pkg)]
+    if not check_conda_installed():
+        print("❌ Conda not found. Please install Miniconda or Anaconda:")
+        print("   https://docs.conda.io/en/latest/miniconda.html")
+        return False

-    messages = []
-    success = len(missing_required) == 0
+    print("✓ Conda is installed")

-    if missing_required:
-        messages.append(f"Missing required packages: {', '.join(missing_required)}")
-        messages.append("Install with: pip install biomni --upgrade")
+    # Check if biomni_e1 environment exists
+    result = subprocess.run(
+        ['conda', 'env', 'list'],
+        capture_output=True,
+        text=True
+    )
+
+    if 'biomni_e1' in result.stdout:
+        print("✓ biomni_e1 environment already exists")
+        return True
+
+    print("\nCreating biomni_e1 conda environment...")
+    print("This will install Python 3.10 and required dependencies.")
+
+    response = input("Proceed? [y/N]: ").strip().lower()
+    if response != 'y':
+        print("Skipping conda environment setup")
+        return False
+
+    try:
+        # Create conda environment
+        subprocess.run(
+            ['conda', 'create', '-n', 'biomni_e1', 'python=3.10', '-y'],
+            check=True
+        )
+
+        print("\n✓ Conda environment created successfully")
+        print("\nTo activate: conda activate biomni_e1")
+        print("Then install biomni: pip install biomni --upgrade")
+        return True
+
+    except subprocess.CalledProcessError as e:
+        print(f"❌ Failed to create conda environment: {e}")
+        return False
+
+
+def setup_api_keys() -> Dict[str, str]:
+    """Interactive API key configuration."""
+    print("\n=== API Key Configuration ===")
+    print("Biomni supports multiple LLM providers.")
+    print("At minimum, configure one provider.")
+
+    api_keys = {}
+
+    # Anthropic (recommended)
+    print("\n1. Anthropic Claude (Recommended)")
+    print("   Get your API key from: https://console.anthropic.com/")
+    anthropic_key = input("   Enter ANTHROPIC_API_KEY (or press Enter to skip): ").strip()
+    if anthropic_key:
+        api_keys['ANTHROPIC_API_KEY'] = anthropic_key
+
+    # OpenAI
+    print("\n2. OpenAI")
+    print("   Get your API key from: https://platform.openai.com/api-keys")
+    openai_key = input("   Enter OPENAI_API_KEY (or press Enter to skip): ").strip()
+    if openai_key:
+        api_keys['OPENAI_API_KEY'] = openai_key
+
+    # Google Gemini
+    print("\n3. Google Gemini")
+    print("   Get your API key from: https://makersuite.google.com/app/apikey")
+    google_key = input("   Enter GOOGLE_API_KEY (or press Enter to skip): ").strip()
+    if google_key:
+        api_keys['GOOGLE_API_KEY'] = google_key
+
+    # Groq
+    print("\n4. Groq")
+    print("   Get your API key from: https://console.groq.com/keys")
+    groq_key = input("   Enter GROQ_API_KEY (or press Enter to skip): ").strip()
+    if groq_key:
+        api_keys['GROQ_API_KEY'] = groq_key
+
+    if not api_keys:
+        print("\n⚠️  No API keys configured. You'll need at least one to use biomni.")
+        return {}
+
+    return api_keys
+
+
+def save_api_keys(api_keys: Dict[str, str], method: str = 'env_file'):
+    """Save API keys using specified method."""
+    if method == 'env_file':
+        env_file = Path.cwd() / '.env'
+
+        # Read existing .env if present
+        existing_vars = {}
+        if env_file.exists():
+            with open(env_file, 'r') as f:
+                for line in f:
+                    line = line.strip()
+                    if line and not line.startswith('#'):
+                        if '=' in line:
+                            key, val = line.split('=', 1)
+                            existing_vars[key.strip()] = val.strip()
+
+        # Update with new keys
+        existing_vars.update(api_keys)
+
+        # Write to .env
+        with open(env_file, 'w') as f:
+            f.write("# Biomni API Keys\n")
+            f.write(f"# Generated by setup_environment.py\n\n")
+            for key, value in existing_vars.items():
+                f.write(f"{key}={value}\n")
+
+        print(f"\n✓ API keys saved to {env_file}")
+        print("  Keys will be loaded automatically when biomni runs in this directory")
+
+    elif method == 'shell_export':
+        shell_file = Path.home() / '.bashrc'  # or .zshrc for zsh users
+
+        print("\n📋 Add these lines to your shell configuration:")
+        for key, value in api_keys.items():
+            print(f"   export {key}=\"{value}\"")
+
+        print(f"\nThen run: source {shell_file}")
+
+
+def setup_data_directory() -> Optional[Path]:
+    """Configure biomni data lake directory."""
+    print("\n=== Data Lake Configuration ===")
+    print("Biomni requires ~11GB for integrated biomedical databases.")
+
+    default_path = Path.cwd() / 'biomni_data'
+    print(f"\nDefault location: {default_path}")
+
+    response = input("Use default location? [Y/n]: ").strip().lower()
+
+    if response == 'n':
+        custom_path = input("Enter custom path: ").strip()
+        data_path = Path(custom_path).expanduser().resolve()
    else:
-        messages.append("Required packages: ✓")
+        data_path = default_path

-    if missing_optional:
-        messages.append(f"Missing optional packages: {', '.join(missing_optional)}")
-        messages.append("For PDF reports, install: pip install weasyprint")
+    # Create directory if it doesn't exist
+    data_path.mkdir(parents=True, exist_ok=True)

-    return success, messages
+    print(f"\n✓ Data directory configured: {data_path}")
+    print("  Data will be downloaded automatically on first use")
+
+    return data_path


-def check_api_keys() -> Tuple[bool, Dict[str, bool]]:
-    """Check which API keys are configured."""
-    api_keys = {
-        'ANTHROPIC_API_KEY': os.environ.get('ANTHROPIC_API_KEY'),
-        'OPENAI_API_KEY': os.environ.get('OPENAI_API_KEY'),
-        'GEMINI_API_KEY': os.environ.get('GEMINI_API_KEY'),
-        'GROQ_API_KEY': os.environ.get('GROQ_API_KEY'),
-    }
+def test_installation(data_path: Path):
+    """Test biomni installation with a simple query."""
+    print("\n=== Installation Test ===")
+    print("Testing biomni installation with a simple query...")

-    configured = {key: bool(value) for key, value in api_keys.items()}
-    has_any = any(configured.values())
+    response = input("Run test? [Y/n]: ").strip().lower()
+    if response == 'n':
+        print("Skipping test")
+        return

-    return has_any, configured
+    test_code = f'''
+import os
+from biomni.agent import A1

+# Use environment variables for API keys
+agent = A1(path='{data_path}', llm='claude-sonnet-4-20250514')

-def check_data_directory(data_path: str = './data') -> Tuple[bool, str]:
-    """Check if Biomni data directory exists and has content."""
-    path = Path(data_path)
+# Simple test query
+result = agent.go("What is the primary function of the TP53 gene?")
+print("Test result:", result)
+'''

-    if not path.exists():
-        return False, f"Data directory not found at {data_path}"
+    test_file = Path('test_biomni.py')
+    with open(test_file, 'w') as f:
+        f.write(test_code)

-    # Check if directory has files (data has been downloaded)
-    files = list(path.glob('*'))
-    if len(files) == 0:
-        return False, f"Data directory exists but is empty. Run agent once to download."
+    print(f"\nTest script created: {test_file}")
+    print("Running test...")

-    # Rough size check (should be ~11GB)
-    total_size = sum(f.stat().st_size for f in path.rglob('*') if f.is_file())
-    size_gb = total_size / (1024**3)
-
-    if size_gb < 1:
-        return False, f"Data directory exists but seems incomplete ({size_gb:.1f} GB)"
-
-    return True, f"Data directory: {data_path} ({size_gb:.1f} GB) ✓"
-
-
-def check_disk_space(required_gb: float = 20) -> Tuple[bool, str]:
-    """Check if sufficient disk space is available."""
    try:
-        import shutil
-        stat = shutil.disk_usage('.')
-        free_gb = stat.free / (1024**3)
-
-        if free_gb >= required_gb:
-            return True, f"Disk space: {free_gb:.1f} GB available ✓"
-        else:
-            return False, f"Low disk space: {free_gb:.1f} GB (need {required_gb} GB)"
-    except Exception as e:
-        return False, f"Could not check disk space: {e}"
+        subprocess.run([sys.executable, str(test_file)], check=True)
+        print("\n✓ Test completed successfully!")
+        test_file.unlink()  # Clean up test file
+    except subprocess.CalledProcessError:
+        print("\n❌ Test failed. Check your configuration.")
+        print(f"   Test script saved as {test_file} for debugging")


-def test_biomni_import() -> Tuple[bool, str]:
-    """Test if Biomni can be imported and initialized."""
-    try:
-        from biomni.agent import A1
-        from biomni.config import default_config
-        return True, "Biomni import successful ✓"
-    except ImportError as e:
-        return False, f"Cannot import Biomni: {e}"
-    except Exception as e:
-        return False, f"Biomni import error: {e}"
+def generate_example_script(data_path: Path):
+    """Generate example usage script."""
+    example_code = f'''#!/usr/bin/env python3
+"""
+Example biomni usage script

+This demonstrates basic biomni usage patterns.
+Modify this script for your research tasks.
+"""

-def suggest_fixes(results: Dict[str, Tuple[bool, any]]) -> List[str]:
-    """Generate suggestions for fixing issues."""
-    suggestions = []
+from biomni.agent import A1

-    if not results['python'][0]:
-        suggestions.append("➜ Upgrade Python to 3.8 or higher")
+# Initialize agent
+agent = A1(
+    path='{data_path}',
+    llm='claude-sonnet-4-20250514'  # or your preferred LLM
+)

-    if not results['conda'][0]:
-        suggestions.append("➜ Activate biomni environment: conda activate biomni_e1")
+# Example 1: Simple gene query
+print("Example 1: Gene function query")
+result = agent.go("""
+What are the main functions of the BRCA1 gene?
+Include information about:
+- Molecular function
+- Associated diseases
+- Protein interactions
+""")
+print(result)
+print("-" * 80)

-    if not results['dependencies'][0]:
-        suggestions.append("➜ Install Biomni: pip install biomni --upgrade")
+# Example 2: Data analysis
+print("\\nExample 2: GWAS analysis")
+result = agent.go("""
+Explain how to analyze GWAS summary statistics for:
+1. Identifying genome-wide significant variants
+2. Mapping variants to genes
+3. Pathway enrichment analysis
+""")
+print(result)

-    if not results['api_keys'][0]:
-        suggestions.append("➜ Set API key: export ANTHROPIC_API_KEY='your-key'")
-        suggestions.append("   Or create .env file with API keys")
+# Save conversation history
+agent.save_conversation_history("example_results.pdf")
+print("\\nResults saved to example_results.pdf")
+'''

-    if not results['data'][0]:
-        suggestions.append("➜ Data will auto-download on first agent.go() call")
+    example_file = Path('example_biomni_usage.py')
+    with open(example_file, 'w') as f:
+        f.write(example_code)

-    if not results['disk_space'][0]:
-        suggestions.append("➜ Free up disk space (need ~20GB total)")
-
-    return suggestions
+    print(f"\n✓ Example script created: {example_file}")


 def main():
-    """Run all environment checks and display results."""
+    """Main setup workflow."""
    print("=" * 60)
-    print("Biomni Environment Validation")
+    print("Biomni Environment Setup")
    print("=" * 60)
-    print()

-    # Run all checks
-    results = {}
+    # Step 1: Conda environment
+    conda_success = setup_conda_environment()

-    print("Checking Python version...")
-    results['python'] = check_python_version()
-    print(f"  {results['python'][1]}")
-    print()
+    if conda_success:
+        print("\n⚠️  Remember to activate the environment:")
+        print("   conda activate biomni_e1")
+        print("   pip install biomni --upgrade")

-    print("Checking conda environment...")
-    results['conda'] = check_conda_env()
-    print(f"  {results['conda'][1]}")
-    print()
+    # Step 2: API keys
+    api_keys = setup_api_keys()

-    print("Checking dependencies...")
-    results['dependencies'] = check_dependencies()
-    for msg in results['dependencies'][1]:
-        print(f"  {msg}")
-    print()
+    if api_keys:
+        print("\nHow would you like to store API keys?")
+        print("1. .env file (recommended, local to this directory)")
+        print("2. Shell export (add to .bashrc/.zshrc)")

-    print("Checking API keys...")
-    results['api_keys'] = check_api_keys()
-    has_keys, key_status = results['api_keys']
-    for key, configured in key_status.items():
-        status = "✓" if configured else "✗"
-        print(f"  {key}: {status}")
-    print()
+        choice = input("Choose [1/2]: ").strip()

-    print("Checking Biomni data directory...")
-    results['data'] = check_data_directory()
-    print(f"  {results['data'][1]}")
-    print()
+        if choice == '2':
+            save_api_keys(api_keys, method='shell_export')
+        else:
+            save_api_keys(api_keys, method='env_file')

-    print("Checking disk space...")
-    results['disk_space'] = check_disk_space()
-    print(f"  {results['disk_space'][1]}")
-    print()
+    # Step 3: Data directory
+    data_path = setup_data_directory()

-    print("Testing Biomni import...")
-    results['biomni_import'] = test_biomni_import()
-    print(f"  {results['biomni_import'][1]}")
-    print()
+    # Step 4: Generate example script
+    if data_path:
+        generate_example_script(data_path)
+
+    # Step 5: Test installation (optional)
+    if api_keys and data_path:
+        test_installation(data_path)

    # Summary
+    print("\n" + "=" * 60)
+    print("Setup Complete!")
    print("=" * 60)
-    all_passed = all(result[0] for result in results.values())

-    if all_passed:
-        print("✓ All checks passed! Environment is ready.")
-        print()
-        print("Quick start:")
-        print("  from biomni.agent import A1")
-        print("  agent = A1(path='./data', llm='claude-sonnet-4-20250514')")
-        print("  agent.go('Your biomedical task')")
+    if conda_success:
+        print("✓ Conda environment: biomni_e1")
+
+    if api_keys:
+        print(f"✓ API keys configured: {', '.join(api_keys.keys())}")
+
+    if data_path:
+        print(f"✓ Data directory: {data_path}")
+
+    print("\nNext steps:")
+    if conda_success:
+        print("1. conda activate biomni_e1")
+        print("2. pip install biomni --upgrade")
+        print("3. Run example_biomni_usage.py to test")
    else:
-        print("⚠ Some checks failed. See suggestions below:")
-        print()
-        suggestions = suggest_fixes(results)
-        for suggestion in suggestions:
-            print(suggestion)
+        print("1. Install conda/miniconda")
+        print("2. Run this script again")

-    print("=" * 60)
-
-    return 0 if all_passed else 1
+    print("\nFor documentation, see:")
+    print("  - GitHub: https://github.com/snap-stanford/biomni")
+    print("  - Paper: https://www.biorxiv.org/content/10.1101/2025.05.30.656746v1")


 if __name__ == "__main__":
-    sys.exit(main())
+    try:
+        main()
+    except KeyboardInterrupt:
+        print("\n\nSetup interrupted by user")
+        sys.exit(1)
+    except Exception as e:
+        print(f"\n❌ Error during setup: {e}")
+        sys.exit(1)