Improved Biomni support

2026-01-26 16:58:56 +08:00 · 2025-10-22 08:38:06 -07:00
parent 71a3c3750f
commit 77822efeed
9 changed files with 2512 additions and 3393 deletions
--- a/.claude-plugin/marketplace.json
+++ b/.claude-plugin/marketplace.json
@@ -7,7 +7,7 @@
  },
  "metadata": {
    "description": "Claude scientific skills from K-Dense Inc",
-    "version": "1.50.0"
+    "version": "1.51.0"
  },
  "plugins": [
    {
--- a/docs/scientific-packages.md
+++ b/docs/scientific-packages.md
@@ -56,7 +56,7 @@
 - **scikit-bio** - Bioinformatics sequence analysis and diversity metrics
 - **Zarr** - Chunked, compressed N-dimensional array storage
-## Multi-omics & Integration
+## Multi-omics & AI Agent Frameworks
- **BIOMNI** - Multi-omics data integration with LLM-powered analysis
+- **BIOMNI** - Autonomous biomedical AI agent framework from Stanford SNAP lab for executing complex research tasks across genomics, drug discovery, molecular biology, and clinical analysis. Combines LLM reasoning with code execution and ~11GB of integrated biomedical databases (Ensembl, NCBI Gene, UniProt, PDB, AlphaFold, ClinVar, OMIM, HPO, PubMed, KEGG, Reactome, GO). Supports multiple LLM providers (Claude, GPT-4, Gemini, Groq, Bedrock). Includes A1 agent class for autonomous task decomposition, BiomniEval1 benchmark framework, and MCP server integration. Use cases: CRISPR screening design, single-cell RNA-seq analysis, ADMET prediction, GWAS interpretation, rare disease diagnosis, protein structure analysis, literature synthesis, and multi-omics integration
--- a/scientific-packages/biomni/SKILL.md
+++ b/scientific-packages/biomni/SKILL.md
@@ -1,375 +1,316 @@
 ---
 name: biomni
-description: "AI agent for autonomous biomedical task execution. CRISPR design, scRNA-seq, ADMET, GWAS, structure prediction, multi-omics, with automated planning/code generation, for complex workflows."
+description: Autonomous biomedical AI agent framework for executing complex research tasks across genomics, drug discovery, molecular biology, and clinical analysis. Use this skill when conducting multi-step biomedical research including CRISPR screening design, single-cell RNA-seq analysis, ADMET prediction, GWAS interpretation, rare disease diagnosis, or lab protocol optimization. Leverages LLM reasoning with code execution and integrated biomedical databases.
 ---
 # Biomni
 ## Overview
-Biomni is a general-purpose biomedical AI agent that autonomously executes research tasks across diverse biomedical subfields. Use Biomni to combine large language model reasoning with retrieval-augmented planning and code-based execution for scientific productivity and hypothesis generation. The system operates with an ~11GB biomedical knowledge base covering molecular, genomic, and clinical domains.
+Biomni is an open-source biomedical AI agent framework from Stanford's SNAP lab that autonomously executes complex research tasks across biomedical domains. Use this skill when working on multi-step biological reasoning tasks, analyzing biomedical data, or conducting research spanning genomics, drug discovery, molecular biology, and clinical analysis.
 ## Core Capabilities
 Biomni excels at:
 1. **Multi-step biological reasoning** - Autonomous task decomposition and planning for complex biomedical queries
 2. **Code generation and execution** - Dynamic analysis pipeline creation for data processing
 3. **Knowledge retrieval** - Access to ~11GB of integrated biomedical databases and literature
 4. **Cross-domain problem solving** - Unified interface for genomics, proteomics, drug discovery, and clinical tasks
 ## When to Use This Skill
 Use biomni for:
 - **CRISPR screening** - Design screens, prioritize genes, analyze knockout effects
 - **Single-cell RNA-seq** - Cell type annotation, differential expression, trajectory analysis
 - **Drug discovery** - ADMET prediction, target identification, compound optimization
 - **GWAS analysis** - Variant interpretation, causal gene identification, pathway enrichment
 - **Clinical genomics** - Rare disease diagnosis, variant pathogenicity, phenotype-genotype mapping
 - **Lab protocols** - Protocol optimization, literature synthesis, experimental design
 ## Quick Start
-Initialize and use the Biomni agent with these basic steps:
+### Installation and Setup
 Biomni requires conda environment setup and API keys for LLM providers:
 ```bash
 # Clone repository and set up environment
 git clone https://github.com/snap-stanford/biomni
 cd biomni
 bash setup.sh
 # Or install via pip
 conda activate biomni_e1
 pip install biomni --upgrade
 ```
 Configure API keys (store in `.env` file or environment variables):
 ```bash
 export ANTHROPIC_API_KEY="your-key-here"
 # Optional: OpenAI, Azure, Google, Groq, AWS Bedrock keys
 ```
 Use `scripts/setup_environment.py` for interactive setup assistance.
 ### Basic Usage Pattern
 ```python
 from biomni.agent import A1
-# Initialize agent with data path and LLM model
+# Initialize agent with data path and LLM choice
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
-# Execute a biomedical research task
+# Execute biomedical task autonomously
-agent.go("Your biomedical task description")
+agent.go("Your biomedical research question or task")
 # Save conversation history and results
 agent.save_conversation_history("report.pdf")
 ```
-The agent will autonomously decompose the task, retrieve relevant biomedical knowledge, generate and execute code, and provide results.
+## Working with Biomni
-## Installation and Setup
+### 1. Agent Initialization
-### Environment Preparation
+The A1 class is the primary interface for biomni:
 1. **Set up the conda environment:**
   - Follow instructions in `biomni_env/README.md` from the repository
   - Activate the environment: `conda activate biomni_e1`
 2. **Install the package:**
   ```bash
   pip install biomni --upgrade
   ```
   Or install from source:
   ```bash
   git clone https://github.com/snap-stanford/biomni.git
   cd biomni
   pip install -e .
   ```
 3. **Configure API keys:**
   Set up credentials via environment variables or `.env` file:
   ```bash
   export ANTHROPIC_API_KEY="your-key-here"
   export OPENAI_API_KEY="your-key-here"  # Optional
   ```
 4. **Data initialization:**
   On first use, the agent will automatically download the ~11GB biomedical knowledge base.
 ### LLM Provider Configuration
 Biomni supports multiple LLM providers. Configure the default provider using:
 ```python
 from biomni.agent import A1
 from biomni.config import default_config
-# Set the default LLM model
+# Basic initialization
 default_config.llm = "claude-sonnet-4-20250514"  # Anthropic
 # default_config.llm = "gpt-4"  # OpenAI
 # default_config.llm = "azure/gpt-4"  # Azure OpenAI
 # default_config.llm = "gemini/gemini-pro"  # Google Gemini
 # Set timeout (optional)
 default_config.timeout_seconds = 1200
 # Set data path (optional)
 default_config.data_path = "./custom/data/path"
 ```
 Refer to `references/llm_providers.md` for detailed configuration options for each provider.
 ## Core Biomedical Research Tasks
 ### 1. CRISPR Screening and Design
 Execute CRISPR screening tasks including guide RNA design, off-target analysis, and screening experiment planning:
 ```python
 agent.go("Design a CRISPR screening experiment to identify genes involved in cancer cell resistance to drug X")
 ```
 The agent will:
 - Retrieve relevant gene databases
 - Design guide RNAs with specificity analysis
 - Plan experimental controls and readout strategies
 - Generate analysis code for screening results
 ### 2. Single-Cell RNA-seq Analysis
 Perform comprehensive scRNA-seq analysis workflows:
 ```python
 agent.go("Analyze this 10X Genomics scRNA-seq dataset, identify cell types, and find differentially expressed genes between clusters")
 ```
 Capabilities include:
 - Quality control and preprocessing
 - Dimensionality reduction and clustering
 - Cell type annotation using marker databases
 - Differential expression analysis
 - Pathway enrichment analysis
 ### 3. Molecular Property Prediction (ADMET)
 Predict absorption, distribution, metabolism, excretion, and toxicity properties:
 ```python
 agent.go("Predict ADMET properties for these drug candidates: [SMILES strings]")
 ```
 The agent handles:
 - Molecular descriptor calculation
 - Property prediction using integrated models
 - Toxicity screening
 - Drug-likeness assessment
 ### 4. Genomic Analysis
 Execute genomic data analysis tasks:
 ```python
 agent.go("Perform GWAS analysis to identify SNPs associated with disease phenotype in this cohort")
 ```
 Supports:
 - Genome-wide association studies (GWAS)
 - Variant calling and annotation
 - Population genetics analysis
 - Functional genomics integration
 ### 5. Protein Structure and Function
 Analyze protein sequences and structures:
 ```python
 agent.go("Predict the structure of this protein sequence and identify potential binding sites")
 ```
 Capabilities:
 - Sequence analysis and domain identification
 - Structure prediction integration
 - Binding site prediction
 - Protein-protein interaction analysis
 ### 6. Disease Diagnosis and Classification
 Perform disease classification from multi-omics data:
 ```python
 agent.go("Build a classifier to diagnose disease X from patient RNA-seq and clinical data")
 ```
 ### 7. Systems Biology and Pathway Analysis
 Analyze biological pathways and networks:
 ```python
 agent.go("Identify dysregulated pathways in this differential expression dataset")
 ```
 ### 8. Drug Discovery and Repurposing
 Support drug discovery workflows:
 ```python
 agent.go("Identify FDA-approved drugs that could be repurposed for treating disease Y based on mechanism of action")
 ```
 ## Advanced Features
 ### Custom Configuration per Agent
 Override global configuration for specific agent instances:
 ```python
 agent = A1(
-    path='./project_data',
+    path='./data',  # Path to data lake (~11GB downloaded on first use)
-    llm='gpt-4o',
+    llm='claude-sonnet-4-20250514'  # LLM model selection
    timeout=1800
 )
 # Advanced configuration
 default_config.llm = "gpt-4"
 default_config.timeout_seconds = 1200
 default_config.max_iterations = 50
 ```
-### Conversation History and Reporting
+**Supported LLM Providers:**
 - Anthropic Claude (recommended): `claude-sonnet-4-20250514`, `claude-opus-4-20250514`
 - OpenAI: `gpt-4`, `gpt-4-turbo`
 - Azure OpenAI: via Azure configuration
 - Google Gemini: `gemini-2.0-flash-exp`
 - Groq: `llama-3.3-70b-versatile`
 - AWS Bedrock: Various models via Bedrock API
-Save execution traces as formatted PDF reports:
+See `references/llm_providers.md` for detailed LLM configuration instructions.
 ### 2. Task Execution Workflow
 Biomni follows an autonomous agent workflow:
 ```python
-# After executing tasks
+# Step 1: Initialize agent
-agent.save_conversation_history(
+agent = A1(path='./data', llm='claude-sonnet-4-20250514')
-    output_path='./reports/experiment_log.pdf',
+
-    format='pdf'
+# Step 2: Execute task with natural language query
 result = agent.go("""
 Design a CRISPR screen to identify genes regulating autophagy in
 HEK293 cells. Prioritize genes based on essentiality and pathway
 relevance.
 """)
 # Step 3: Review generated code and analysis
 # Agent autonomously:
 # - Decomposes task into sub-steps
 # - Retrieves relevant biological knowledge
 # - Generates and executes analysis code
 # - Interprets results and provides insights
 # Step 4: Save results
 agent.save_conversation_history("autophagy_screen_report.pdf")
 ```
 ### 3. Common Task Patterns
 #### CRISPR Screening Design
 ```python
 agent.go("""
 Design a genome-wide CRISPR knockout screen for identifying genes
 affecting [phenotype] in [cell type]. Include:
 1. sgRNA library design
 2. Gene prioritization criteria
 3. Expected hit genes based on pathway analysis
 """)
 ```
 #### Single-Cell RNA-seq Analysis
 ```python
 agent.go("""
 Analyze this single-cell RNA-seq dataset:
 - Perform quality control and filtering
 - Identify cell populations via clustering
 - Annotate cell types using marker genes
 - Conduct differential expression between conditions
 File path: [path/to/data.h5ad]
 """)
 ```
 #### Drug ADMET Prediction
 ```python
 agent.go("""
 Predict ADMET properties for these drug candidates:
 [SMILES strings or compound IDs]
 Focus on:
 - Absorption (Caco-2 permeability, HIA)
 - Distribution (plasma protein binding, BBB penetration)
 - Metabolism (CYP450 interaction)
 - Excretion (clearance)
 - Toxicity (hERG liability, hepatotoxicity)
 """)
 ```
 #### GWAS Variant Interpretation
 ```python
 agent.go("""
 Interpret GWAS results for [trait/disease]:
 - Identify genome-wide significant variants
 - Map variants to causal genes
 - Perform pathway enrichment analysis
 - Predict functional consequences
 Summary statistics file: [path/to/gwas_summary.txt]
 """)
 ```
 See `references/use_cases.md` for comprehensive task examples across all biomedical domains.
 ### 4. Data Integration
 Biomni integrates ~11GB of biomedical knowledge sources:
 - **Gene databases** - Ensembl, NCBI Gene, UniProt
 - **Protein structures** - PDB, AlphaFold
 - **Clinical datasets** - ClinVar, OMIM, HPO
 - **Literature indices** - PubMed abstracts, biomedical ontologies
 - **Pathway databases** - KEGG, Reactome, GO
 Data is automatically downloaded to the specified `path` on first use.
 ### 5. MCP Server Integration
 Extend biomni with external tools via Model Context Protocol:
 ```python
 # MCP servers can provide:
 # - FDA drug databases
 # - Web search for literature
 # - Custom biomedical APIs
 # - Laboratory equipment interfaces
 # Configure MCP servers in .biomni/mcp_config.json
 ```
 ### 6. Evaluation Framework
 Benchmark agent performance on biomedical tasks:
 ```python
 from biomni.eval import BiomniEval1
 evaluator = BiomniEval1()
 # Evaluate on specific task types
 score = evaluator.evaluate(
    task_type='crispr_design',
    instance_id='test_001',
    answer=agent_output
 )
 # Access evaluation dataset
 dataset = evaluator.load_dataset()
 ```
 Requires one of: WeasyPrint, markdown2pdf, or Pandoc.
 ### Model Context Protocol (MCP) Integration
 Extend agent capabilities with external tools:
 ```python
 # Add MCP-compatible tools
 agent.add_mcp(config_path='./mcp_config.json')
 ```
 MCP enables integration with:
 - Laboratory information management systems (LIMS)
 - Specialized bioinformatics databases
 - Custom analysis pipelines
 - External computational resources
 ### Using Biomni-R0 (Specialized Reasoning Model)
 Deploy the 32B parameter Biomni-R0 model for enhanced biological reasoning:
 ```bash
 # Install SGLang
 pip install "sglang[all]"
 # Deploy Biomni-R0
 python -m sglang.launch_server \
    --model-path snap-stanford/biomni-r0 \
    --port 30000 \
    --trust-remote-code
 ```
 Then configure the agent:
 ```python
 from biomni.config import default_config
 default_config.llm = "openai/biomni-r0"
 default_config.api_base = "http://localhost:30000/v1"
 ```
 Biomni-R0 provides specialized reasoning for:
 - Complex multi-step biological workflows
 - Hypothesis generation and evaluation
 - Experimental design optimization
 - Literature-informed analysis
 ## Best Practices
-### Task Specification
+### Task Formulation
-
+- **Be specific** - Include biological context, organism, cell type, conditions
-Provide clear, specific task descriptions:
+- **Specify outputs** - Clearly state desired analysis outputs and formats
-
+- **Provide data paths** - Include file paths for datasets to analyze
-✅ **Good:** "Analyze this scRNA-seq dataset (file: data.h5ad) to identify T cell subtypes, then perform differential expression analysis comparing activated vs. resting T cells"
+- **Set constraints** - Mention time/computational limits if relevant
 ❌ **Vague:** "Analyze my RNA-seq data"
 ### Data Organization
 Structure data directories for efficient retrieval:
 ```
 project/
 ├── data/              # Biomni knowledge base
 ├── raw_data/          # Your experimental data
 ├── results/           # Analysis outputs
 └── reports/           # Generated reports
 ```
 ### Iterative Refinement
 Use iterative task execution for complex analyses:
 ```python
 # Step 1: Exploratory analysis
 agent.go("Load and perform initial QC on the proteomics dataset")
 # Step 2: Based on results, refine analysis
 agent.go("Based on the QC results, remove low-quality samples and normalize using method X")
 # Step 3: Downstream analysis
 agent.go("Perform differential abundance analysis with adjusted parameters")
 ```
 ### Security Considerations
 ⚠️ **Important**: Biomni executes LLM-generated code with full system privileges. For production use:
 - Run in isolated environments (Docker, VMs)
 - Avoid exposing sensitive credentials
 - Review generated code before execution in sensitive contexts
 - Use sandboxed execution environments when possible
-**CRITICAL:** Biomni executes LLM-generated code with full system privileges. For production use:
+### Performance Optimization
 - **Choose appropriate LLMs** - Claude Sonnet 4 recommended for balance of speed/quality
 - **Set reasonable timeouts** - Adjust `default_config.timeout_seconds` for complex tasks
 - **Monitor iterations** - Track `max_iterations` to prevent runaway loops
 - **Cache data** - Reuse downloaded data lake across sessions
-1. **Use sandboxed environments:** Deploy in Docker containers or VMs with restricted permissions
+### Result Documentation
-2. **Validate sensitive operations:** Review code before execution for file access, network calls, or credential usage
+```python
-3. **Limit data access:** Restrict agent access to only necessary data directories
+# Always save conversation history for reproducibility
-4. **Monitor execution:** Log all executed code for audit trails
+agent.save_conversation_history("results/project_name_YYYYMMDD.pdf")
-Never run Biomni with:
+# Include in reports:
- Unrestricted file system access
+# - Original task description
- Direct access to sensitive credentials
+# - Generated analysis code
- Network access to production systems
+# - Results and interpretations
- Elevated system privileges
+# - Data sources used
 ```
-### Model Selection Guidelines
+## Resources
-Choose models based on task complexity:
+### References
 Detailed documentation available in the `references/` directory:
- **Claude Sonnet 4:** Recommended for most biomedical tasks, excellent biological reasoning
+- **`api_reference.md`** - Complete API documentation for A1 class, configuration, and evaluation
- **GPT-4/GPT-4o:** Strong general capabilities, good for diverse tasks
+- **`llm_providers.md`** - LLM provider setup (Anthropic, OpenAI, Azure, Google, Groq, AWS)
- **Biomni-R0:** Specialized for complex biological reasoning, multi-step workflows
+- **`use_cases.md`** - Comprehensive task examples for all biomedical domains
 - **Smaller models:** Use for simple, well-defined tasks to reduce cost
-## Evaluation and Benchmarking
+### Scripts
 Helper scripts in the `scripts/` directory:
-Biomni-Eval1 benchmark contains 433 evaluation instances across 10 biological tasks:
+- **`setup_environment.py`** - Interactive environment and API key configuration
 - **`generate_report.py`** - Enhanced PDF report generation with custom formatting
- GWAS analysis
+### External Resources
- Disease diagnosis
+- **GitHub**: https://github.com/snap-stanford/biomni
- Gene detection and classification
+- **Web Platform**: https://biomni.stanford.edu
- Molecular property prediction
+- **Paper**: https://www.biorxiv.org/content/10.1101/2025.05.30.656746v1
- Pathway analysis
+- **Model**: https://huggingface.co/biomni/Biomni-R0-32B-Preview
- Protein function prediction
+- **Evaluation Dataset**: https://huggingface.co/datasets/biomni/Eval1
 - Drug response prediction
 - Variant interpretation
 - Cell type annotation
 - Biomarker discovery
 Use the benchmark to:
 - Evaluate custom agent configurations
 - Compare LLM providers for specific tasks
 - Validate analysis pipelines
 ## Troubleshooting
 ### Common Issues
-**Issue:** Data download fails or times out
+**Data download fails**
-**Solution:** Manually download the knowledge base or increase timeout settings
+```python
 # Manually trigger data lake download
 agent = A1(path='./data', llm='your-llm')
 # First .go() call will download data
 ```
-**Issue:** Package dependency conflicts
+**API key errors**
-**Solution:** Some optional dependencies cannot be installed by default due to conflicts. Install specific packages manually and uncomment relevant code sections as documented in the repository
+```bash
 # Verify environment variables
 echo $ANTHROPIC_API_KEY
 # Or check .env file in working directory
 ```
-**Issue:** LLM API errors
+**Timeout on complex tasks**
-**Solution:** Verify API key configuration, check rate limits, ensure sufficient credits
+```python
 from biomni.config import default_config
 default_config.timeout_seconds = 3600  # 1 hour
 ```
-**Issue:** Memory errors with large datasets
+**Memory issues with large datasets**
-**Solution:** Process data in chunks, use data subsampling, or deploy on higher-memory instances
+- Use streaming for large files
 - Process data in chunks
 - Increase system memory allocation
 ### Getting Help
-For detailed troubleshooting:
+For issues or questions:
- Review the Biomni GitHub repository issues
+- GitHub Issues: https://github.com/snap-stanford/biomni/issues
- Check `references/api_reference.md` for detailed API documentation
+- Documentation: Check `references/` files for detailed guidance
- Consult `references/task_examples.md` for comprehensive task patterns
+- Community: Stanford SNAP lab and biomni contributors
 ## Resources
 ### references/
 Detailed reference documentation for advanced usage:
 - **api_reference.md:** Complete API documentation for A1 agent, configuration objects, and utility functions
 - **llm_providers.md:** Comprehensive guide for configuring all supported LLM providers (Anthropic, OpenAI, Azure, Gemini, Groq, Ollama, AWS Bedrock)
 - **task_examples.md:** Extensive collection of biomedical task examples with code patterns
 ### scripts/
 Helper scripts for common operations:
 - **setup_environment.py:** Automated environment setup and validation
 - **generate_report.py:** Enhanced PDF report generation with custom formatting
 Load reference documentation as needed:
 ```python
 # Claude can read reference files when needed for detailed information
 # Example: "Check references/llm_providers.md for Azure OpenAI configuration"
 ```
--- a/scientific-packages/biomni/references/api_reference.md
+++ b/scientific-packages/biomni/references/api_reference.md
@@ -1,635 +1,460 @@
 # Biomni API Reference
-This document provides comprehensive API documentation for the Biomni biomedical AI agent system.
+Comprehensive API documentation for the biomni framework.
-## Core Classes
+## A1 Agent Class
-### A1 Agent
+The A1 class is the primary interface for interacting with biomni.
-The primary agent class for executing biomedical research tasks.
+### Initialization
 #### Initialization
 ```python
 from biomni.agent import A1
 agent = A1(
-    path='./data',              # Path to biomedical knowledge base
+    path: str,              # Path to data lake directory
-    llm='claude-sonnet-4-20250514',  # LLM model identifier
+    llm: str,               # LLM model identifier
-    timeout=None,               # Optional timeout in seconds
+    verbose: bool = True,   # Enable verbose logging
-    verbose=True               # Enable detailed logging
+    mcp_config: str = None  # Path to MCP server configuration
 )
 ```
 **Parameters:**
- `path` (str, required): Directory path where the biomedical knowledge base is stored or will be downloaded. First-time initialization will download ~11GB of data.
+- **`path`** (str, required) - Directory path for biomni data lake (~11GB). Data is automatically downloaded on first use if not present.
 - `llm` (str, optional): LLM model identifier. Defaults to the value in `default_config.llm`. Supports multiple providers (see LLM Providers section).
 - `timeout` (int, optional): Maximum execution time in seconds for agent operations. Overrides `default_config.timeout_seconds`.
 - `verbose` (bool, optional): Enable verbose logging for debugging. Default: True.
-**Returns:** A1 agent instance ready for task execution.
+- **`llm`** (str, required) - LLM model identifier. Options include:
  - `'claude-sonnet-4-20250514'` - Recommended for balanced performance
  - `'claude-opus-4-20250514'` - Maximum capability
  - `'gpt-4'`, `'gpt-4-turbo'` - OpenAI models
  - `'gemini-2.0-flash-exp'` - Google Gemini
  - `'llama-3.3-70b-versatile'` - Via Groq
  - Custom model endpoints via provider configuration
-#### Methods
+- **`verbose`** (bool, optional, default=True) - Enable detailed logging of agent reasoning, tool use, and code execution.
-##### `go(task_description: str) -> None`
+- **`mcp_config`** (str, optional) - Path to MCP (Model Context Protocol) server configuration file for external tool integration.
 **Example:**
 ```python
 # Basic initialization
 agent = A1(path='./biomni_data', llm='claude-sonnet-4-20250514')
 # With MCP integration
 agent = A1(
    path='./biomni_data',
    llm='claude-sonnet-4-20250514',
    mcp_config='./.biomni/mcp_config.json'
 )
 ```
 ### Core Methods
 #### `go(query: str) -> str`
 Execute a biomedical research task autonomously.
 ```python
-agent.go("Analyze this scRNA-seq dataset and identify cell types")
+result = agent.go(query: str)
 ```
 **Parameters:**
- `task_description` (str, required): Natural language description of the biomedical task to execute. Be specific about:
+- **`query`** (str) - Natural language description of the biomedical task to execute
-  - Data location and format
+
-  - Desired analysis or output
+**Returns:**
-  - Any specific methods or parameters
+- **`str`** - Final answer or analysis result from the agent
  - Expected results format
 **Behavior:**
-1. Decomposes the task into executable steps
+1. Decomposes query into executable sub-tasks
-2. Retrieves relevant biomedical knowledge from the data lake
+2. Retrieves relevant knowledge from integrated databases
-3. Generates and executes Python/R code
+3. Generates and executes Python code for analysis
-4. Provides results and visualizations
+4. Iterates on results until task completion
-5. Handles errors and retries with refinement
+5. Returns final synthesized answer
-**Notes:**
+**Example:**
- Executes code with system privileges - use in sandboxed environments
+```python
- Long-running tasks may require timeout adjustments
+result = agent.go("""
- Intermediate results are displayed during execution
+Identify genes associated with Alzheimer's disease from GWAS data.
 Perform pathway enrichment analysis on top hits.
 """)
 print(result)
 ```
-##### `save_conversation_history(output_path: str, format: str = 'pdf') -> None`
+#### `save_conversation_history(output_path: str, format: str = 'pdf')`
-Export conversation history and execution trace as a formatted report.
+Save complete conversation history including task, reasoning, code, and results.
 ```python
 agent.save_conversation_history(
-    output_path='./reports/analysis_log.pdf',
+    output_path: str,
-    format='pdf'
+    format: str = 'pdf'
 )
 ```
 **Parameters:**
- `output_path` (str, required): File path for the output report
+- **`output_path`** (str) - File path for saved report
- `format` (str, optional): Output format. Options: 'pdf', 'markdown'. Default: 'pdf'
+- **`format`** (str, optional, default='pdf') - Output format: `'pdf'`, `'html'`, or `'markdown'`
-**Requirements:**
+**Example:**
- For PDF: Install one of: WeasyPrint, markdown2pdf, or Pandoc
+```python
-  ```bash
+agent.save_conversation_history('reports/alzheimers_gwas_analysis.pdf')
-  pip install weasyprint  # Recommended
+```
  # or
  pip install markdown2pdf
  # or install Pandoc system-wide
  ```
-**Report Contents:**
+#### `reset()`
 - Task description and parameters
 - Retrieved biomedical knowledge
 - Generated code with execution traces
 - Results, visualizations, and outputs
 - Timestamps and execution metadata
-##### `add_mcp(config_path: str) -> None`
+Reset agent state and clear conversation history.
 Add Model Context Protocol (MCP) tools to extend agent capabilities.
 ```python
-agent.add_mcp(config_path='./mcp_tools_config.json')
+agent.reset()
 ```
-**Parameters:**
+Use when starting a new independent task to clear previous context.
 - `config_path` (str, required): Path to MCP configuration JSON file
-**MCP Configuration Format:**
+**Example:**
-```json
+```python
-{
+# Task 1
-  "tools": [
+agent.go("Analyze dataset A")
-    {
+agent.save_conversation_history("task1.pdf")
-      "name": "tool_name",
+
-      "endpoint": "http://localhost:8000/tool",
+# Reset for fresh context
-      "description": "Tool description for LLM",
+agent.reset()
-      "parameters": {
+
-        "param1": "string",
+# Task 2 - independent of Task 1
-        "param2": "integer"
+agent.go("Analyze dataset B")
      }
    }
  ]
 }
 ```
-**Use Cases:**
+### Configuration via default_config
 - Connect to laboratory information systems
 - Integrate proprietary databases
 - Access specialized computational resources
 - Link to institutional data repositories
-## Configuration
+Global configuration parameters accessible via `biomni.config.default_config`.
 ### default_config
 Global configuration object for Biomni settings.
 ```python
 from biomni.config import default_config
 ```
-#### Attributes
+# LLM Configuration
 ##### `llm: str`
 Default LLM model identifier for all agent instances.
 ```python
 default_config.llm = "claude-sonnet-4-20250514"
-```
+default_config.llm_temperature = 0.7
-**Supported Models:**
+# Execution Parameters
 **Anthropic:**
 - `claude-sonnet-4-20250514` (Recommended)
 - `claude-opus-4-20250514`
 - `claude-3-5-sonnet-20241022`
 - `claude-3-opus-20240229`
 **OpenAI:**
 - `gpt-4o`
 - `gpt-4`
 - `gpt-4-turbo`
 - `gpt-3.5-turbo`
 **Azure OpenAI:**
 - `azure/gpt-4`
 - `azure/<deployment-name>`
 **Google Gemini:**
 - `gemini/gemini-pro`
 - `gemini/gemini-1.5-pro`
 **Groq:**
 - `groq/llama-3.1-70b-versatile`
 - `groq/mixtral-8x7b-32768`
 **Ollama (Local):**
 - `ollama/llama3`
 - `ollama/mistral`
 - `ollama/<model-name>`
 **AWS Bedrock:**
 - `bedrock/anthropic.claude-v2`
 - `bedrock/anthropic.claude-3-sonnet`
 **Custom/Biomni-R0:**
 - `openai/biomni-r0` (requires local SGLang deployment)
 ##### `timeout_seconds: int`
 Default timeout for agent operations in seconds.
 ```python
 default_config.timeout_seconds = 1200  # 20 minutes
 default_config.max_iterations = 50     # Max reasoning loops
 default_config.max_tokens = 4096       # Max tokens per LLM call
 # Code Execution
 default_config.enable_code_execution = True
 default_config.sandbox_mode = False    # Enable for restricted execution
 # Data and Caching
 default_config.data_cache_dir = "./biomni_cache"
 default_config.enable_caching = True
 ```
-**Recommended Values:**
+**Key Parameters:**
 - Simple tasks (QC, basic analysis): 300-600 seconds
 - Medium tasks (differential expression, clustering): 600-1200 seconds
 - Complex tasks (full pipelines, ML models): 1200-3600 seconds
 - Very complex tasks: 3600+ seconds
-##### `data_path: str`
+- **`timeout_seconds`** (int, default=1200) - Maximum time for task execution. Increase for complex analyses.
-Default path to biomedical knowledge base.
+- **`max_iterations`** (int, default=50) - Maximum agent reasoning loops. Prevents infinite loops.
 - **`enable_code_execution`** (bool, default=True) - Allow agent to execute generated code. Disable for code generation only.
 - **`sandbox_mode`** (bool, default=False) - Enable sandboxed code execution (requires additional setup).
 ## BiomniEval1 Evaluation Framework
 Framework for benchmarking agent performance on biomedical tasks.
 ### Initialization
 ```python
-default_config.data_path = "/path/to/biomni/data"
+from biomni.eval import BiomniEval1
 evaluator = BiomniEval1(
    dataset_path: str = None,  # Path to evaluation dataset
    metrics: list = None        # Evaluation metrics to compute
 )
 ```
-**Storage Requirements:**
+**Example:**
 - Initial download: ~11GB
 - Extracted size: ~15GB
 - Additional working space: ~5-10GB recommended
 ##### `api_base: str`
 Custom API endpoint for LLM providers (advanced usage).
 ```python
-# For local Biomni-R0 deployment
+evaluator = BiomniEval1()
 default_config.api_base = "http://localhost:30000/v1"
 # For custom OpenAI-compatible endpoints
 default_config.api_base = "https://your-endpoint.com/v1"
 ```
-##### `max_retries: int`
+### Methods
-Number of retry attempts for failed operations.
+#### `evaluate(task_type: str, instance_id: str, answer: str) -> float`
 Evaluate agent answer against ground truth.
 ```python
-default_config.max_retries = 3
+score = evaluator.evaluate(
-```
+    task_type: str,     # Task category
-
+    instance_id: str,   # Specific task instance
-#### Methods
+    answer: str         # Agent-generated answer
 ##### `reset() -> None`
 Reset all configuration values to system defaults.
 ```python
 default_config.reset()
 ```
 ## Database Query System
 Biomni includes a retrieval-augmented generation (RAG) system for querying the biomedical knowledge base.
 ### Query Functions
 #### `query_genes(query: str, top_k: int = 10) -> List[Dict]`
 Query gene information from integrated databases.
 ```python
 from biomni.database import query_genes
 results = query_genes(
    query="genes involved in p53 pathway",
    top_k=20
 )
 ```
 **Parameters:**
- `query` (str): Natural language or gene identifier query
+- **`task_type`** (str) - Task category: `'crispr_design'`, `'scrna_analysis'`, `'gwas_interpretation'`, `'drug_admet'`, `'clinical_diagnosis'`
- `top_k` (int): Number of results to return
+- **`instance_id`** (str) - Unique identifier for task instance from dataset
 - **`answer`** (str) - Agent's answer to evaluate
-**Returns:** List of dictionaries containing:
+**Returns:**
- `gene_symbol`: Official gene symbol
+- **`float`** - Evaluation score (0.0 to 1.0)
 - `gene_name`: Full gene name
 - `description`: Functional description
 - `pathways`: Associated biological pathways
 - `go_terms`: Gene Ontology annotations
 - `diseases`: Associated diseases
 - `similarity_score`: Relevance score (0-1)
-#### `query_proteins(query: str, top_k: int = 10) -> List[Dict]`
+**Example:**
 ```python
 # Generate answer
 result = agent.go("Design CRISPR screen for autophagy genes")
-Query protein information from UniProt and other sources.
+# Evaluate
 score = evaluator.evaluate(
    task_type='crispr_design',
    instance_id='autophagy_001',
    answer=result
 )
 print(f"Score: {score:.2f}")
 ```
 #### `load_dataset() -> dict`
 Load the Biomni-Eval1 benchmark dataset.
 ```python
-from biomni.database import query_proteins
+dataset = evaluator.load_dataset()
 ```
-results = query_proteins(
+**Returns:**
-    query="kinase proteins in cell cycle",
+- **`dict`** - Dictionary with task instances organized by task type
-    top_k=15
+
 **Example:**
 ```python
 dataset = evaluator.load_dataset()
 for task_type, instances in dataset.items():
    print(f"{task_type}: {len(instances)} instances")
 ```
 #### `run_benchmark(agent: A1, task_types: list = None) -> dict`
 Run full benchmark evaluation on agent.
 ```python
 results = evaluator.run_benchmark(
    agent: A1,
    task_types: list = None  # Specific task types or None for all
 )
 ```
-**Returns:** List of dictionaries with protein metadata:
+**Returns:**
- `uniprot_id`: UniProt accession
+- **`dict`** - Results with scores, timing, and detailed metrics per task
 - `protein_name`: Protein name
 - `function`: Functional annotation
 - `domains`: Protein domains
 - `subcellular_location`: Cellular localization
 - `similarity_score`: Relevance score
 #### `query_drugs(query: str, top_k: int = 10) -> List[Dict]`
 Query drug and compound information.
 **Example:**
 ```python
-from biomni.database import query_drugs
+results = evaluator.run_benchmark(
-
+    agent=agent,
-results = query_drugs(
+    task_types=['crispr_design', 'scrna_analysis']
    query="FDA approved cancer drugs targeting EGFR",
    top_k=10
 )
 print(f"Overall accuracy: {results['mean_score']:.2f}")
 print(f"Average time: {results['mean_time']:.1f}s")
 ```
-**Returns:** Drug information including:
+## Data Lake API
 - `drug_name`: Common name
 - `drugbank_id`: DrugBank identifier
 - `indication`: Therapeutic indication
 - `mechanism`: Mechanism of action
 - `targets`: Molecular targets
 - `approval_status`: Regulatory status
 - `smiles`: Chemical structure (SMILES notation)
-#### `query_diseases(query: str, top_k: int = 10) -> List[Dict]`
+Access integrated biomedical databases programmatically.
-Query disease information from clinical databases.
+### Gene Database Queries
 ```python
-from biomni.database import query_diseases
+from biomni.data import GeneDB
-results = query_diseases(
+gene_db = GeneDB(path='./biomni_data')
-    query="autoimmune diseases affecting joints",
+
-    top_k=10
+# Query gene information
-)
+gene_info = gene_db.get_gene('BRCA1')
 # Returns: {'symbol': 'BRCA1', 'name': '...', 'function': '...', ...}
 # Search genes by pathway
 pathway_genes = gene_db.search_by_pathway('DNA repair')
 # Returns: List of gene symbols in pathway
 # Get gene interactions
 interactions = gene_db.get_interactions('TP53')
 # Returns: List of interacting genes with interaction types
 ```
-**Returns:** Disease data:
+### Protein Structure Access
 - `disease_name`: Standard disease name
 - `disease_id`: Ontology identifier
 - `symptoms`: Clinical manifestations
 - `associated_genes`: Genetic associations
 - `prevalence`: Epidemiological data
 #### `query_pathways(query: str, top_k: int = 10) -> List[Dict]`
 Query biological pathways from KEGG, Reactome, and other sources.
 ```python
-from biomni.database import query_pathways
+from biomni.data import ProteinDB
-results = query_pathways(
+protein_db = ProteinDB(path='./biomni_data')
-    query="immune response signaling pathways",
+
-    top_k=15
+# Get AlphaFold structure
-)
+structure = protein_db.get_structure('P38398')  # BRCA1 UniProt ID
 # Returns: Path to PDB file or structure object
 # Search PDB database
 pdb_entries = protein_db.search_pdb('kinase', resolution_max=2.5)
 # Returns: List of PDB IDs matching criteria
 ```
-**Returns:** Pathway information:
+### Clinical Data Access
 - `pathway_name`: Pathway name
 - `pathway_id`: Database identifier
 - `genes`: Genes in pathway
 - `description`: Functional description
 - `source`: Database source (KEGG, Reactome, etc.)
 ## Data Structures
 ### TaskResult
 Result object returned by complex agent operations.
 ```python
-class TaskResult:
+from biomni.data import ClinicalDB
-    success: bool           # Whether task completed successfully
+
-    output: Any            # Task output (varies by task)
+clinical_db = ClinicalDB(path='./biomni_data')
-    code: str             # Generated code
+
-    execution_time: float # Execution time in seconds
+# Query ClinVar variants
-    error: Optional[str]  # Error message if failed
+variant_info = clinical_db.get_variant('rs429358')  # APOE4 variant
-    metadata: Dict        # Additional metadata
+# Returns: {'significance': '...', 'disease': '...', 'frequency': ...}
 # Search OMIM for disease
 disease_info = clinical_db.search_omim('Alzheimer')
 # Returns: List of OMIM entries with gene associations
 ```
-### BiomedicalEntity
+### Literature Search
 Base class for biomedical entities in the knowledge base.
 ```python
-class BiomedicalEntity:
+from biomni.data import LiteratureDB
-    entity_id: str        # Unique identifier
+
-    entity_type: str      # Type (gene, protein, drug, etc.)
+lit_db = LiteratureDB(path='./biomni_data')
-    name: str            # Entity name
+
-    description: str     # Description
+# Search PubMed abstracts
-    attributes: Dict     # Additional attributes
+papers = lit_db.search('CRISPR screening cancer', max_results=10)
-    references: List[str] # Literature references
+# Returns: List of paper dictionaries with titles, abstracts, PMIDs
 # Get citations for paper
 citations = lit_db.get_citations('PMID:12345678')
 # Returns: List of citing papers
 ```
-## Utility Functions
+## MCP Server Integration
-### `download_data(path: str, force: bool = False) -> None`
+Extend biomni with external tools via Model Context Protocol.
-Manually download or update the biomedical knowledge base.
+### Configuration Format
 Create `.biomni/mcp_config.json`:
 ```json
 {
  "servers": {
    "fda-drugs": {
      "command": "python",
      "args": ["-m", "mcp_server_fda"],
      "env": {
        "FDA_API_KEY": "${FDA_API_KEY}"
      }
    },
    "web-search": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-brave-search"],
      "env": {
        "BRAVE_API_KEY": "${BRAVE_API_KEY}"
      }
    }
  }
 }
 ```
 ### Using MCP Tools in Tasks
 ```python
-from biomni.utils import download_data
+# Initialize with MCP config
-
+agent = A1(
 download_data(
    path='./data',
-    force=True  # Force re-download
+    llm='claude-sonnet-4-20250514',
    mcp_config='./.biomni/mcp_config.json'
 )
 ```
-### `validate_environment() -> Dict[str, bool]`
+# Agent can now use MCP tools automatically
-
+result = agent.go("""
-Check if the environment is properly configured.
+Search for FDA-approved drugs targeting EGFR.
-
+Get their approval dates and indications.
-```python
+""")
-from biomni.utils import validate_environment
+# Agent uses fda-drugs MCP server automatically
 status = validate_environment()
 # Returns: {
 #   'conda_env': True,
 #   'api_keys': True,
 #   'data_available': True,
 #   'dependencies': True
 # }
 ```
 ### `list_available_models() -> List[str]`
 Get a list of available LLM models based on configured API keys.
 ```python
 from biomni.utils import list_available_models
 models = list_available_models()
 # Returns: ['claude-sonnet-4-20250514', 'gpt-4o', ...]
 ```
 ## Error Handling
-### Common Exceptions
+Common exceptions and handling strategies:
 #### `BiomniConfigError`
 Raised when configuration is invalid or incomplete.
 ```python
-from biomni.exceptions import BiomniConfigError
+from biomni.exceptions import (
    BiomniException,
    LLMError,
    CodeExecutionError,
    DataNotFoundError,
    TimeoutError
 )
 try:
-    agent = A1(path='./data')
+    result = agent.go("Complex biomedical task")
-except BiomniConfigError as e:
+except TimeoutError:
-    print(f"Configuration error: {e}")
+    # Task exceeded timeout_seconds
-```
+    print("Task timed out. Consider increasing timeout.")
-
+    default_config.timeout_seconds = 3600
-#### `BiomniExecutionError`
+except CodeExecutionError as e:
-
+    # Generated code failed to execute
-Raised when code generation or execution fails.
+    print(f"Code execution error: {e}")
-
+    # Review generated code in conversation history
-```python
+except DataNotFoundError:
-from biomni.exceptions import BiomniExecutionError
+    # Required data not in data lake
-
+    print("Data not found. Ensure data lake is downloaded.")
-try:
+except LLMError as e:
-    agent.go("invalid task")
+    # LLM API error
-except BiomniExecutionError as e:
+    print(f"LLM error: {e}")
-    print(f"Execution failed: {e}")
+    # Check API keys and rate limits
    # Access failed code: e.code
    # Access error details: e.details
 ```
 #### `BiomniDataError`
 Raised when knowledge base or data access fails.
 ```python
 from biomni.exceptions import BiomniDataError
 try:
    results = query_genes("unknown query format")
 except BiomniDataError as e:
    print(f"Data access error: {e}")
 ```
 #### `BiomniTimeoutError`
 Raised when operations exceed timeout limit.
 ```python
 from biomni.exceptions import BiomniTimeoutError
 try:
    agent.go("very complex long-running task")
 except BiomniTimeoutError as e:
    print(f"Task timed out after {e.duration} seconds")
    # Partial results may be available: e.partial_results
 ```
 ## Best Practices
-### Efficient Knowledge Retrieval
+### Efficient API Usage
-Pre-query databases for relevant context before complex tasks:
+1. **Reuse agent instances** for related tasks to maintain context
 2. **Set appropriate timeouts** based on task complexity
 3. **Use caching** to avoid redundant data downloads
 4. **Monitor iterations** to detect reasoning loops early
 ### Production Deployment
 ```python
-from biomni.database import query_genes, query_pathways
+from biomni.agent import A1
 from biomni.config import default_config
 import logging
-# Gather relevant biological context first
+# Configure logging
-genes = query_genes("cell cycle genes", top_k=50)
+logging.basicConfig(level=logging.INFO)
 pathways = query_pathways("cell cycle regulation", top_k=20)
-# Then execute task with enriched context
+# Production settings
-agent.go(f"""
+default_config.timeout_seconds = 3600
-Analyze the cell cycle progression in this dataset.
+default_config.max_iterations = 100
-Focus on these genes: {[g['gene_symbol'] for g in genes]}
+default_config.sandbox_mode = True  # Enable sandboxing
 Consider these pathways: {[p['pathway_name'] for p in pathways]}
 """)
 ```
-### Error Recovery
+# Initialize with error handling
-
+try:
-Implement robust error handling for production workflows:
+    agent = A1(path='/data/biomni', llm='claude-sonnet-4-20250514')
-
+    result = agent.go(task_query)
-```python
+    agent.save_conversation_history(f'reports/{task_id}.pdf')
-from biomni.exceptions import BiomniExecutionError, BiomniTimeoutError
+except Exception as e:
-
+    logging.error(f"Task {task_id} failed: {e}")
-max_attempts = 3
+    # Handle failure appropriately
 for attempt in range(max_attempts):
    try:
        agent.go("complex biomedical task")
        break
    except BiomniTimeoutError:
        # Increase timeout and retry
        default_config.timeout_seconds *= 2
        print(f"Timeout, retrying with {default_config.timeout_seconds}s timeout")
    except BiomniExecutionError as e:
        # Refine task based on error
        print(f"Execution failed: {e}, refining task...")
        # Optionally modify task description
    else:
        print("Task failed after max attempts")
 ```
 ### Memory Management
-For large-scale analyses, manage memory explicitly:
+For large-scale analyses:
 ```python
 import gc
 # Process datasets in chunks
-for chunk_id in range(num_chunks):
+chunk_results = []
-    agent.go(f"Process data chunk {chunk_id} located at data/chunk_{chunk_id}.h5ad")
+for chunk in dataset_chunks:
    agent.reset()  # Clear memory between chunks
    result = agent.go(f"Analyze chunk: {chunk}")
    chunk_results.append(result)
-    # Force garbage collection between chunks
+# Combine results
-    gc.collect()
+final_result = combine_results(chunk_results)
    # Save intermediate results
    agent.save_conversation_history(f"./reports/chunk_{chunk_id}.pdf")
 ```
 ### Reproducibility
 Ensure reproducible analyses by:
 1. **Fixing random seeds:**
 ```python
 agent.go("Set random seed to 42 for all analyses, then perform clustering...")
 ```
 2. **Logging configuration:**
 ```python
 import json
 config_log = {
    'llm': default_config.llm,
    'timeout': default_config.timeout_seconds,
    'data_path': default_config.data_path,
    'timestamp': datetime.now().isoformat()
 }
 with open('config_log.json', 'w') as f:
    json.dump(config_log, f, indent=2)
 ```
 3. **Saving execution traces:**
 ```python
 # Always save detailed reports
 agent.save_conversation_history('./reports/full_analysis.pdf')
 ```
 ## Performance Optimization
 ### Model Selection Strategy
 Choose models based on task characteristics:
 ```python
 # For exploratory, simple tasks
 default_config.llm = "gpt-3.5-turbo"  # Fast, cost-effective
 # For standard biomedical analyses
 default_config.llm = "claude-sonnet-4-20250514"  # Recommended
 # For complex reasoning and hypothesis generation
 default_config.llm = "claude-opus-4-20250514"  # Highest quality
 # For specialized biological reasoning
 default_config.llm = "openai/biomni-r0"  # Requires local deployment
 ```
 ### Timeout Tuning
 Set appropriate timeouts based on task complexity:
 ```python
 # Quick queries and simple analyses
 agent = A1(path='./data', timeout=300)
 # Standard workflows
 agent = A1(path='./data', timeout=1200)
 # Full pipelines with ML training
 agent = A1(path='./data', timeout=3600)
 ```
 ### Caching and Reuse
 Reuse agent instances for multiple related tasks:
 ```python
 # Create agent once
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 # Execute multiple related tasks
 tasks = [
    "Load and QC the scRNA-seq dataset",
    "Perform clustering with resolution 0.5",
    "Identify marker genes for each cluster",
    "Annotate cell types based on markers"
 ]
 for task in tasks:
    agent.go(task)
 # Save complete workflow
 agent.save_conversation_history('./reports/full_workflow.pdf')
 ```
--- a/scientific-packages/biomni/references/llm_providers.md
+++ b/scientific-packages/biomni/references/llm_providers.md
--- a/scientific-packages/biomni/references/task_examples.md
+++ b/scientific-packages/biomni/references/task_examples.md
--- a/scientific-packages/biomni/references/use_cases.md
+++ b/scientific-packages/biomni/references/use_cases.md
@@ -0,0 +1,867 @@
 # Biomni Use Cases and Examples
 Comprehensive examples demonstrating biomni across biomedical research domains.
 ## Table of Contents
 1. [CRISPR Screening and Gene Editing](#crispr-screening-and-gene-editing)
 2. [Single-Cell RNA-seq Analysis](#single-cell-rna-seq-analysis)
 3. [Drug Discovery and ADMET](#drug-discovery-and-admet)
 4. [GWAS and Genetic Analysis](#gwas-and-genetic-analysis)
 5. [Clinical Genomics and Diagnostics](#clinical-genomics-and-diagnostics)
 6. [Protein Structure and Function](#protein-structure-and-function)
 7. [Literature and Knowledge Synthesis](#literature-and-knowledge-synthesis)
 8. [Multi-Omics Integration](#multi-omics-integration)
 ---
 ## CRISPR Screening and Gene Editing
 ### Example 1: Genome-Wide CRISPR Screen Design
 **Task:** Design a CRISPR knockout screen to identify genes regulating autophagy.
 ```python
 from biomni.agent import A1
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 result = agent.go("""
 Design a genome-wide CRISPR knockout screen to identify genes regulating
 autophagy in HEK293 cells.
 Requirements:
 1. Generate comprehensive sgRNA library targeting all protein-coding genes
 2. Design 4 sgRNAs per gene with optimal on-target and minimal off-target scores
 3. Include positive controls (known autophagy regulators: ATG5, BECN1, ULK1)
 4. Include negative controls (non-targeting sgRNAs)
 5. Prioritize genes based on:
   - Existing autophagy pathway annotations
   - Protein-protein interactions with known autophagy factors
   - Expression levels in HEK293 cells
 6. Output sgRNA sequences, scores, and gene prioritization rankings
 Provide analysis as Python code and interpret results.
 """)
 agent.save_conversation_history("autophagy_screen_design.pdf")
 ```
 **Expected Output:**
 - sgRNA library with ~80,000 guides (4 per gene × ~20,000 genes)
 - On-target and off-target scores for each sgRNA
 - Prioritized gene list based on pathway enrichment
 - Quality control metrics for library design
 ### Example 2: CRISPR Off-Target Prediction
 ```python
 result = agent.go("""
 Analyze potential off-target effects for this sgRNA sequence:
 GCTGAAGATCCAGTTCGATG
 Tasks:
 1. Identify all genomic locations with ≤3 mismatches
 2. Score each potential off-target site
 3. Assess likelihood of cleavage at off-target sites
 4. Recommend whether sgRNA is suitable for use
 5. If unsuitable, suggest alternative sgRNAs for the same gene
 """)
 ```
 ### Example 3: Screen Hit Analysis
 ```python
 result = agent.go("""
 Analyze CRISPR screen results from autophagy phenotype screen.
 Input file: screen_results.csv
 Columns: sgRNA_ID, gene, log2_fold_change, p_value, FDR
 Tasks:
 1. Identify significant hits (FDR < 0.05, |LFC| > 1.5)
 2. Perform gene ontology enrichment on hit genes
 3. Map hits to known autophagy pathways
 4. Identify novel candidates not previously linked to autophagy
 5. Predict functional relationships between hit genes
 6. Generate visualization of hit genes in pathway context
 """)
 ```
 ---
 ## Single-Cell RNA-seq Analysis
 ### Example 1: Cell Type Annotation
 **Task:** Analyze single-cell RNA-seq data and annotate cell populations.
 ```python
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 result = agent.go("""
 Analyze single-cell RNA-seq dataset from human PBMC sample.
 File: pbmc_data.h5ad (10X Genomics format)
 Workflow:
 1. Quality control:
   - Filter cells with <200 or >5000 detected genes
   - Remove cells with >20% mitochondrial content
   - Filter genes detected in <3 cells
 2. Normalization and preprocessing:
   - Normalize to 10,000 reads per cell
   - Log-transform
   - Identify highly variable genes
   - Scale data
 3. Dimensionality reduction:
   - PCA (50 components)
   - UMAP visualization
 4. Clustering:
   - Leiden algorithm with resolution=0.8
   - Identify cluster markers (Wilcoxon rank-sum test)
 5. Cell type annotation:
   - Annotate clusters using marker genes:
     * T cells (CD3D, CD3E)
     * B cells (CD79A, MS4A1)
     * NK cells (GNLY, NKG7)
     * Monocytes (CD14, LYZ)
     * Dendritic cells (FCER1A, CST3)
 6. Generate UMAP plots with annotations and export results
 """)
 agent.save_conversation_history("pbmc_scrna_analysis.pdf")
 ```
 ### Example 2: Differential Expression Analysis
 ```python
 result = agent.go("""
 Perform differential expression analysis between conditions in scRNA-seq data.
 Data: pbmc_treated_vs_control.h5ad
 Conditions: treated (drug X) vs control
 Tasks:
 1. Identify differentially expressed genes for each cell type
 2. Use statistical tests appropriate for scRNA-seq (MAST or Wilcoxon)
 3. Apply multiple testing correction (Benjamini-Hochberg)
 4. Threshold: |log2FC| > 0.5, adjusted p < 0.05
 5. Perform pathway enrichment on DE genes per cell type
 6. Identify cell-type-specific drug responses
 7. Generate volcano plots and heatmaps
 """)
 ```
 ### Example 3: Trajectory Analysis
 ```python
 result = agent.go("""
 Perform pseudotime trajectory analysis on differentiation dataset.
 Data: hematopoiesis_scrna.h5ad
 Starting population: Hematopoietic stem cells (HSCs)
 Analysis:
 1. Subset to hematopoietic lineages
 2. Compute diffusion map or PAGA for trajectory inference
 3. Order cells along pseudotime
 4. Identify genes with dynamic expression along trajectory
 5. Cluster genes by expression patterns
 6. Map trajectories to known differentiation pathways
 7. Visualize key transcription factors driving differentiation
 """)
 ```
 ---
 ## Drug Discovery and ADMET
 ### Example 1: ADMET Property Prediction
 **Task:** Predict ADMET properties for drug candidates.
 ```python
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 result = agent.go("""
 Predict ADMET properties for these drug candidates:
 Compounds (SMILES format):
 1. CC1=C(C=C(C=C1)NC(=O)C2=CC=C(C=C2)CN3CCN(CC3)C)NC4=NC=CC(=N4)C5=CN=CC=C5
 2. CN1CCN(CC1)C2=C(C=C3C(=C2)N=CN=C3NC4=CC=C(C=C4)F)OC
 3. CC(C)(C)NC(=O)N(CC1=CC=CC=C1)C2CCN(CC2)C(=O)C3=CC4=C(C=C3)OCO4
 For each compound, predict:
 **Absorption:**
 - Caco-2 permeability (cm/s)
 - Human intestinal absorption (HIA %)
 - Oral bioavailability
 **Distribution:**
 - Plasma protein binding (%)
 - Blood-brain barrier penetration (BBB+/-)
 - Volume of distribution (L/kg)
 **Metabolism:**
 - CYP450 substrate/inhibitor predictions (2D6, 3A4, 2C9, 2C19)
 - Metabolic stability (T1/2)
 **Excretion:**
 - Clearance (mL/min/kg)
 - Half-life (hours)
 **Toxicity:**
 - hERG IC50 (cardiotoxicity risk)
 - Hepatotoxicity prediction
 - Ames mutagenicity
 - LD50 estimates
 Provide predictions with confidence scores and flag any red flags.
 """)
 agent.save_conversation_history("admet_predictions.pdf")
 ```
 ### Example 2: Target Identification
 ```python
 result = agent.go("""
 Identify potential protein targets for Alzheimer's disease drug development.
 Tasks:
 1. Query GWAS data for Alzheimer's-associated genes
 2. Identify genes with druggable domains (kinases, GPCRs, ion channels, etc.)
 3. Check for brain expression patterns
 4. Assess disease relevance via literature mining
 5. Evaluate existing chemical probe availability
 6. Rank targets by:
   - Genetic evidence strength
   - Druggability
   - Lack of existing therapies
 7. Suggest target validation experiments
 """)
 ```
 ### Example 3: Virtual Screening
 ```python
 result = agent.go("""
 Perform virtual screening for EGFR kinase inhibitors.
 Database: ZINC15 lead-like subset (~6M compounds)
 Target: EGFR kinase domain (PDB: 1M17)
 Workflow:
 1. Prepare protein structure (remove waters, add hydrogens)
 2. Define binding pocket (based on erlotinib binding site)
 3. Generate pharmacophore model from known EGFR inhibitors
 4. Filter ZINC database by:
   - Molecular weight: 200-500 Da
   - LogP: 0-5
   - Lipinski's rule of five
   - Pharmacophore match
 5. Dock top 10,000 compounds
 6. Score by docking energy and predicted binding affinity
 7. Select top 100 for further analysis
 8. Predict ADMET properties for top hits
 9. Recommend top 10 compounds for experimental validation
 """)
 ```
 ---
 ## GWAS and Genetic Analysis
 ### Example 1: GWAS Summary Statistics Analysis
 **Task:** Interpret GWAS results and identify causal genes.
 ```python
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 result = agent.go("""
 Analyze GWAS summary statistics for Type 2 Diabetes.
 Input file: t2d_gwas_summary.txt
 Columns: CHR, BP, SNP, P, OR, BETA, SE, A1, A2
 Analysis steps:
 1. Identify genome-wide significant variants (P < 5e-8)
 2. Perform LD clumping to identify independent signals
 3. Map variants to genes using:
   - Nearest gene
   - eQTL databases (GTEx)
   - Hi-C chromatin interactions
 4. Prioritize causal genes using multiple evidence:
   - Fine-mapping scores
   - Coding variant consequences
   - Gene expression in relevant tissues (pancreas, liver, adipose)
   - Pathway enrichment
 5. Identify druggable targets among causal genes
 6. Compare with known T2D genes and highlight novel associations
 7. Generate Manhattan plot, QQ plot, and gene prioritization table
 """)
 agent.save_conversation_history("t2d_gwas_analysis.pdf")
 ```
 ### Example 2: Polygenic Risk Score
 ```python
 result = agent.go("""
 Develop and validate polygenic risk score (PRS) for coronary artery disease (CAD).
 Training GWAS: CAD_discovery_summary_stats.txt (N=180,000)
 Validation cohort: CAD_validation_genotypes.vcf (N=50,000)
 Tasks:
 1. Select variants for PRS using p-value thresholding (P < 1e-5)
 2. Perform LD clumping (r² < 0.1, 500kb window)
 3. Calculate PRS weights from GWAS betas
 4. Compute PRS for validation cohort individuals
 5. Evaluate PRS performance:
   - AUC for CAD case/control discrimination
   - Odds ratios across PRS deciles
   - Compare to traditional risk factors (age, sex, BMI, smoking)
 6. Assess PRS calibration and create risk stratification plot
 7. Identify high-risk individuals (top 5% PRS)
 """)
 ```
 ### Example 3: Variant Pathogenicity Prediction
 ```python
 result = agent.go("""
 Predict pathogenicity of rare coding variants in candidate disease genes.
 Variants (VCF format):
 - chr17:41234451:A>G (BRCA1 p.Arg1347Gly)
 - chr2:179428448:C>T (TTN p.Trp13579*)
 - chr7:117188679:G>A (CFTR p.Gly542Ser)
 For each variant, assess:
 1. In silico predictions (SIFT, PolyPhen2, CADD, REVEL)
 2. Population frequency (gnomAD)
 3. Evolutionary conservation (PhyloP, PhastCons)
 4. Protein structure impact (using AlphaFold structures)
 5. Functional domain location
 6. ClinVar annotations (if available)
 7. Literature evidence
 8. ACMG/AMP classification criteria
 Provide pathogenicity classification (benign, likely benign, VUS, likely pathogenic, pathogenic) with supporting evidence.
 """)
 ```
 ---
 ## Clinical Genomics and Diagnostics
 ### Example 1: Rare Disease Diagnosis
 **Task:** Diagnose rare genetic disease from whole exome sequencing.
 ```python
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 result = agent.go("""
 Analyze whole exome sequencing (WES) data for rare disease diagnosis.
 Patient phenotypes (HPO terms):
 - HP:0001250 (Seizures)
 - HP:0001249 (Intellectual disability)
 - HP:0001263 (Global developmental delay)
 - HP:0001252 (Hypotonia)
 VCF file: patient_trio.vcf (proband + parents)
 Analysis workflow:
 1. Variant filtering:
   - Quality filters (QUAL > 30, DP > 10, GQ > 20)
   - Frequency filters (gnomAD AF < 0.01)
   - Functional impact (missense, nonsense, frameshift, splice site)
 2. Inheritance pattern analysis:
   - De novo variants
   - Autosomal recessive (compound het, homozygous)
   - X-linked
 3. Phenotype-driven prioritization:
   - Match candidate genes to HPO terms
   - Use HPO-gene associations
   - Check gene expression in relevant tissues (brain)
 4. Variant pathogenicity assessment:
   - In silico predictions
   - ACMG classification
   - Literature evidence
 5. Generate diagnostic report with:
   - Top candidate variants
   - Supporting evidence
   - Functional validation suggestions
   - Genetic counseling recommendations
 """)
 agent.save_conversation_history("rare_disease_diagnosis.pdf")
 ```
 ### Example 2: Cancer Genomics Analysis
 ```python
 result = agent.go("""
 Analyze tumor-normal paired sequencing for cancer genomics.
 Files:
 - tumor_sample.vcf (somatic variants)
 - tumor_rnaseq.bam (gene expression)
 - tumor_cnv.seg (copy number variants)
 Analysis:
 1. Identify driver mutations:
   - Known cancer genes (COSMIC, OncoKB)
   - Recurrent hotspot mutations
   - Truncating mutations in tumor suppressors
 2. Analyze mutational signatures:
   - Decompose signatures (COSMIC signatures)
   - Identify mutagenic processes
 3. Copy number analysis:
   - Identify amplifications and deletions
   - Focal vs. arm-level events
   - Assess oncogene amplifications and TSG deletions
 4. Gene expression analysis:
   - Identify outlier gene expression
   - Fusion transcript detection
   - Pathway dysregulation
 5. Therapeutic implications:
   - Match alterations to FDA-approved therapies
   - Identify clinical trial opportunities
   - Predict response to targeted therapies
 6. Generate precision oncology report
 """)
 ```
 ### Example 3: Pharmacogenomics
 ```python
 result = agent.go("""
 Generate pharmacogenomics report for patient genotype data.
 VCF file: patient_pgx.vcf
 Analyze variants affecting drug metabolism:
 **CYP450 genes:**
 - CYP2D6 (affects ~25% of drugs)
 - CYP2C19 (clopidogrel, PPIs, antidepressants)
 - CYP2C9 (warfarin, NSAIDs)
 - CYP3A5 (tacrolimus, immunosuppressants)
 **Drug transporter genes:**
 - SLCO1B1 (statin myopathy risk)
 - ABCB1 (P-glycoprotein)
 **Drug targets:**
 - VKORC1 (warfarin dosing)
 - DPYD (fluoropyrimidine toxicity)
 - TPMT (thiopurine toxicity)
 For each gene:
 1. Determine diplotype (*1/*1, *1/*2, etc.)
 2. Assign metabolizer phenotype (PM, IM, NM, RM, UM)
 3. Provide dosing recommendations using CPIC/PharmGKB guidelines
 4. Flag high-risk drug-gene interactions
 5. Suggest alternative medications if needed
 Generate patient-friendly report with actionable recommendations.
 """)
 ```
 ---
 ## Protein Structure and Function
 ### Example 1: AlphaFold Structure Analysis
 ```python
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 result = agent.go("""
 Analyze AlphaFold structure prediction for novel protein.
 Protein: Hypothetical protein ABC123 (UniProt: Q9XYZ1)
 Tasks:
 1. Retrieve AlphaFold structure from database
 2. Assess prediction quality:
   - pLDDT scores per residue
   - Identify high-confidence regions (pLDDT > 90)
   - Flag low-confidence regions (pLDDT < 50)
 3. Structural analysis:
   - Identify domains using structural alignment
   - Predict fold family
   - Identify secondary structure elements
 4. Functional prediction:
   - Search for structural homologs in PDB
   - Identify conserved functional sites
   - Predict binding pockets
   - Suggest possible ligands/substrates
 5. Variant impact analysis:
   - Map disease-associated variants to structure
   - Predict structural consequences
   - Identify variants affecting binding sites
 6. Generate PyMOL visualization scripts highlighting key features
 """)
 agent.save_conversation_history("alphafold_analysis.pdf")
 ```
 ### Example 2: Protein-Protein Interaction Prediction
 ```python
 result = agent.go("""
 Predict and analyze protein-protein interactions for autophagy pathway.
 Query proteins: ATG5, ATG12, ATG16L1
 Analysis:
 1. Retrieve known interactions from:
   - STRING database
   - BioGRID
   - IntAct
   - Literature mining
 2. Predict novel interactions using:
   - Structural modeling (AlphaFold-Multimer)
   - Coexpression analysis
   - Phylogenetic profiling
 3. Analyze interaction interfaces:
   - Identify binding residues
   - Assess interface properties (area, hydrophobicity)
   - Predict binding affinity
 4. Functional analysis:
   - Map interactions to autophagy pathway steps
   - Identify regulatory interactions
   - Predict complex stoichiometry
 5. Therapeutic implications:
   - Identify druggable interfaces
   - Suggest peptide inhibitors
   - Design disruption strategies
 Generate network visualization and interaction details.
 """)
 ```
 ---
 ## Literature and Knowledge Synthesis
 ### Example 1: Systematic Literature Review
 ```python
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 result = agent.go("""
 Perform systematic literature review on CRISPR base editing applications.
 Search query: "CRISPR base editing" OR "base editor" OR "CBE" OR "ABE"
 Date range: 2016-2025
 Tasks:
 1. Search PubMed and retrieve relevant abstracts
 2. Filter for original research articles
 3. Extract key information:
   - Base editor type (CBE, ABE, dual)
   - Target organism/cell type
   - Application (disease model, therapy, crop improvement)
   - Editing efficiency
   - Off-target assessment
 4. Categorize applications:
   - Therapeutic applications (by disease)
   - Agricultural applications
   - Basic research
 5. Analyze trends:
   - Publications over time
   - Most studied diseases
   - Evolution of base editor technology
 6. Synthesize findings:
   - Clinical trial status
   - Remaining challenges
   - Future directions
 Generate comprehensive review document with citation statistics.
 """)
 agent.save_conversation_history("crispr_base_editing_review.pdf")
 ```
 ### Example 2: Gene Function Synthesis
 ```python
 result = agent.go("""
 Synthesize knowledge about gene function from multiple sources.
 Target gene: PARK7 (DJ-1)
 Integrate information from:
 1. **Genetic databases:**
   - NCBI Gene
   - UniProt
   - OMIM
 2. **Expression data:**
   - GTEx tissue expression
   - Human Protein Atlas
   - Single-cell expression atlases
 3. **Functional data:**
   - GO annotations
   - KEGG pathways
   - Reactome
   - Protein interactions (STRING)
 4. **Disease associations:**
   - ClinVar variants
   - GWAS catalog
   - Disease databases (DisGeNET)
 5. **Literature:**
   - PubMed abstracts
   - Key mechanistic studies
   - Review articles
 Synthesize into comprehensive gene report:
 - Molecular function
 - Biological processes
 - Cellular localization
 - Tissue distribution
 - Disease associations
 - Known drug targets/inhibitors
 - Unresolved questions
 Generate structured summary suitable for research planning.
 """)
 ```
 ---
 ## Multi-Omics Integration
 ### Example 1: Multi-Omics Disease Analysis
 ```python
 agent = A1(path='./data', llm='claude-sonnet-4-20250514')
 result = agent.go("""
 Integrate multi-omics data to understand disease mechanism.
 Disease: Alzheimer's disease
 Data types:
 - Genomics: GWAS summary statistics (gwas_ad.txt)
 - Transcriptomics: Brain RNA-seq (controls vs AD, rnaseq_data.csv)
 - Proteomics: CSF proteomics (proteomics_csf.csv)
 - Metabolomics: Plasma metabolomics (metabolomics_plasma.csv)
 - Epigenomics: Brain methylation array (methylation_data.csv)
 Integration workflow:
 1. Analyze each omics layer independently:
   - Identify significantly altered features
   - Perform pathway enrichment
 2. Cross-omics correlation:
   - Correlate gene expression with protein levels
   - Link genetic variants to expression (eQTL)
   - Associate methylation with gene expression
   - Connect proteins to metabolites
 3. Network analysis:
   - Build multi-omics network
   - Identify key hub genes/proteins
   - Detect disease modules
 4. Causal inference:
   - Prioritize drivers vs. consequences
   - Identify therapeutic targets
   - Predict drug mechanisms
 5. Generate integrative model of AD pathogenesis
 Provide visualization and therapeutic target recommendations.
 """)
 agent.save_conversation_history("ad_multiomics_analysis.pdf")
 ```
 ### Example 2: Systems Biology Modeling
 ```python
 result = agent.go("""
 Build systems biology model of metabolic pathway.
 Pathway: Glycolysis
 Data sources:
 - Enzyme kinetics (BRENDA database)
 - Metabolite concentrations (literature)
 - Gene expression (tissue-specific, GTEx)
 - Flux measurements (C13 labeling studies)
 Modeling tasks:
 1. Construct pathway model:
   - Define reactions and stoichiometry
   - Parameterize enzyme kinetics (Km, Vmax, Ki)
   - Set initial metabolite concentrations
 2. Simulate pathway dynamics:
   - Steady-state analysis
   - Time-course simulations
   - Sensitivity analysis
 3. Constraint-based modeling:
   - Flux balance analysis (FBA)
   - Identify bottleneck reactions
   - Predict metabolic engineering strategies
 4. Integrate with gene expression:
   - Tissue-specific model predictions
   - Disease vs. normal comparisons
 5. Therapeutic predictions:
   - Enzyme inhibition effects
   - Metabolic rescue strategies
   - Drug target identification
 Generate model in SBML format and simulation results.
 """)
 ```
 ---
 ## Best Practices for Task Formulation
 ### 1. Be Specific and Detailed
 **Poor:**
 ```python
 agent.go("Analyze this RNA-seq data")
 ```
 **Good:**
 ```python
 agent.go("""
 Analyze bulk RNA-seq data from cancer vs. normal samples.
 Files: cancer_rnaseq.csv (TPM values, 50 cancer, 50 normal)
 Tasks:
 1. Differential expression (DESeq2, padj < 0.05, |log2FC| > 1)
 2. Pathway enrichment (KEGG, Reactome)
 3. Generate volcano plot and top DE gene heatmap
 """)
 ```
 ### 2. Include File Paths and Formats
 Always specify:
 - Exact file paths
 - File formats (VCF, BAM, CSV, H5AD, etc.)
 - Data structure (columns, sample IDs)
 ### 3. Set Clear Success Criteria
 Define thresholds and cutoffs:
 - Statistical significance (P < 0.05, FDR < 0.1)
 - Fold change thresholds
 - Quality filters
 - Expected outputs
 ### 4. Request Visualizations
 Explicitly ask for plots:
 - Volcano plots, MA plots
 - Heatmaps, PCA plots
 - Network diagrams
 - Manhattan plots
 ### 5. Specify Biological Context
 Include:
 - Organism (human, mouse, etc.)
 - Tissue/cell type
 - Disease/condition
 - Treatment details
 ### 6. Request Interpretations
 Ask agent to:
 - Interpret biological significance
 - Suggest follow-up experiments
 - Identify limitations
 - Provide literature context
 ---
 ## Common Patterns
 ### Data Quality Control
 ```python
 """
 Before analysis, perform quality control:
 1. Check for missing values
 2. Assess data distributions
 3. Identify outliers
 4. Generate QC report
 Only proceed with analysis if data passes QC.
 """
 ```
 ### Iterative Refinement
 ```python
 """
 Perform analysis in stages:
 1. Initial exploratory analysis
 2. Based on results, refine parameters
 3. Focus on interesting findings
 4. Generate final report
 Show intermediate results for each stage.
 """
 ```
 ### Reproducibility
 ```python
 """
 Ensure reproducibility:
 1. Set random seeds where applicable
 2. Log all parameters used
 3. Save intermediate files
 4. Export environment info (package versions)
 5. Generate methods section for paper
 """
 ```
 These examples demonstrate the breadth of biomedical tasks biomni can handle. Adapt the patterns to your specific research questions, and always include sufficient detail for the agent to execute autonomously.
--- a/scientific-packages/biomni/scripts/generate_report.py
+++ b/scientific-packages/biomni/scripts/generate_report.py
@@ -1,381 +1,370 @@
 #!/usr/bin/env python3
 """
-Enhanced PDF Report Generation for Biomni
+Enhanced PDF report generation for biomni conversation histories.
-This script provides advanced PDF report generation with custom formatting,
+This script provides additional customization options for biomni reports:
-styling, and metadata for Biomni analysis results.
+- Custom styling and branding
 - Formatted code blocks
 - Section organization
 - Metadata inclusion
 - Export format options (PDF, HTML, Markdown)
 Usage:
    python generate_report.py --input conversation.json --output report.pdf
    python generate_report.py --agent-object agent --output report.pdf --format html
 """
 import argparse
-import sys
+import json
 from pathlib import Path
 from typing import Dict, List, Optional, Any
 from datetime import datetime
 from typing import Optional, Dict, Any
-def generate_markdown_report(
+def format_conversation_history(
-    title: str,
+    messages: List[Dict[str, Any]],
-    sections: list,
+    include_metadata: bool = True,
-    metadata: Optional[Dict[str, Any]] = None,
+    include_code: bool = True,
-    output_path: str = "report.md"
+    include_timestamps: bool = False
 ) -> str:
    """
-    Generate a formatted markdown report.
+    Format conversation history into structured markdown.
    Args:
-        title: Report title
+        messages: List of conversation message dictionaries
-        sections: List of dicts with 'heading' and 'content' keys
+        include_metadata: Include metadata section
-        metadata: Optional metadata dict (author, date, etc.)
+        include_code: Include code blocks
-        output_path: Path to save markdown file
+        include_timestamps: Include message timestamps
    Returns:
-        Path to generated markdown file
+        Formatted markdown string
    """
    md_content = []
    # Title
    md_content.append(f"# {title}\n")
    # Metadata
    if metadata:
        md_content.append("---\n")
        for key, value in metadata.items():
            md_content.append(f"**{key}:** {value}  \n")
        md_content.append("---\n\n")
    # Sections
    for section in sections:
        heading = section.get('heading', 'Section')
        content = section.get('content', '')
        level = section.get('level', 2)  # Default to h2
        md_content.append(f"{'#' * level} {heading}\n\n")
        md_content.append(f"{content}\n\n")
    # Write to file
    output = Path(output_path)
    output.write_text('\n'.join(md_content))
    return str(output)
 def convert_to_pdf_weasyprint(
    markdown_path: str,
    output_path: str,
    css_style: Optional[str] = None
 ) -> bool:
    """
    Convert markdown to PDF using WeasyPrint.
    Args:
        markdown_path: Path to markdown file
        output_path: Path for output PDF
        css_style: Optional CSS stylesheet path
    Returns:
        True if successful, False otherwise
    """
    try:
        import markdown
        from weasyprint import HTML, CSS
        # Read markdown
        with open(markdown_path, 'r') as f:
            md_content = f.read()
        # Convert to HTML
        html_content = markdown.markdown(
            md_content,
            extensions=['tables', 'fenced_code', 'codehilite']
        )
        # Wrap in HTML template
        html_template = f"""
        <!DOCTYPE html>
        <html>
        <head>
            <meta charset="utf-8">
            <title>Biomni Report</title>
            <style>
                body {{
                    font-family: 'Helvetica', 'Arial', sans-serif;
                    line-height: 1.6;
                    color: #333;
                    max-width: 800px;
                    margin: 40px auto;
                    padding: 20px;
                }}
                h1 {{
                    color: #2c3e50;
                    border-bottom: 3px solid #3498db;
                    padding-bottom: 10px;
                }}
                h2 {{
                    color: #34495e;
                    margin-top: 30px;
                    border-bottom: 1px solid #bdc3c7;
                    padding-bottom: 5px;
                }}
                h3 {{
                    color: #7f8c8d;
                }}
                code {{
                    background-color: #f4f4f4;
                    padding: 2px 6px;
                    border-radius: 3px;
                    font-family: 'Courier New', monospace;
                }}
                pre {{
                    background-color: #f4f4f4;
                    padding: 15px;
                    border-radius: 5px;
                    overflow-x: auto;
                }}
                table {{
                    border-collapse: collapse;
                    width: 100%;
                    margin: 20px 0;
                }}
                th, td {{
                    border: 1px solid #ddd;
                    padding: 12px;
                    text-align: left;
                }}
                th {{
                    background-color: #3498db;
                    color: white;
                }}
                tr:nth-child(even) {{
                    background-color: #f9f9f9;
                }}
                .metadata {{
                    background-color: #ecf0f1;
                    padding: 15px;
                    border-radius: 5px;
                    margin: 20px 0;
                }}
            </style>
        </head>
        <body>
            {html_content}
        </body>
        </html>
        """
        # Generate PDF
        pdf = HTML(string=html_template)
        # Add custom CSS if provided
        stylesheets = []
        if css_style and Path(css_style).exists():
            stylesheets.append(CSS(filename=css_style))
        pdf.write_pdf(output_path, stylesheets=stylesheets)
        return True
    except ImportError:
        print("Error: WeasyPrint not installed. Install with: pip install weasyprint")
        return False
    except Exception as e:
        print(f"Error generating PDF: {e}")
        return False
 def convert_to_pdf_pandoc(markdown_path: str, output_path: str) -> bool:
    """
    Convert markdown to PDF using Pandoc.
    Args:
        markdown_path: Path to markdown file
        output_path: Path for output PDF
    Returns:
        True if successful, False otherwise
    """
    try:
        import subprocess
        # Check if pandoc is installed
        result = subprocess.run(
            ['pandoc', '--version'],
            capture_output=True,
            text=True
        )
        if result.returncode != 0:
            print("Error: Pandoc not installed")
            return False
        # Convert with pandoc
        result = subprocess.run(
            [
                'pandoc',
                markdown_path,
                '-o', output_path,
                '--pdf-engine=pdflatex',
                '-V', 'geometry:margin=1in',
                '--toc'
            ],
            capture_output=True,
            text=True
        )
        if result.returncode != 0:
            print(f"Pandoc error: {result.stderr}")
            return False
        return True
    except FileNotFoundError:
        print("Error: Pandoc not found. Install from https://pandoc.org/")
        return False
    except Exception as e:
        print(f"Error: {e}")
        return False
 def create_biomni_report(
    conversation_history: list,
    output_path: str = "biomni_report.pdf",
    method: str = "weasyprint"
 ) -> bool:
    """
    Create a formatted PDF report from Biomni conversation history.
    Args:
        conversation_history: List of conversation turns
        output_path: Output PDF path
        method: Conversion method ('weasyprint' or 'pandoc')
    Returns:
        True if successful
    """
    # Prepare report sections
    metadata = {
        'Date': datetime.now().strftime('%Y-%m-%d %H:%M:%S'),
        'Tool': 'Biomni AI Agent',
        'Report Type': 'Analysis Summary'
    }
    sections = []
-    # Executive Summary
+    # Header
-    sections.append({
+    sections.append("# Biomni Analysis Report\n")
        'heading': 'Executive Summary',
        'level': 2,
        'content': 'This report contains the complete analysis workflow executed by the Biomni biomedical AI agent.'
    })
-    # Conversation history
+    # Metadata
-    for i, turn in enumerate(conversation_history, 1):
+    if include_metadata:
-        sections.append({
+        sections.append("## Metadata\n")
-            'heading': f'Task {i}: {turn.get("task", "Analysis")}',
+        sections.append(f"- **Generated**: {datetime.now().strftime('%Y-%m-%d %H:%M:%S')}")
-            'level': 2,
+        sections.append(f"- **Number of interactions**: {len(messages)}")
-            'content': f'**Input:**\n```\n{turn.get("input", "")}\n```\n\n**Output:**\n{turn.get("output", "")}'
+        sections.append("\n---\n")
-        })
+
    # Process messages
    sections.append("## Analysis\n")
    for i, msg in enumerate(messages, 1):
        role = msg.get('role', 'unknown')
        content = msg.get('content', '')
        if role == 'user':
            sections.append(f"### Task {i // 2 + 1}\n")
            sections.append(f"**Query:**\n```\n{content}\n```\n")
        elif role == 'assistant':
            sections.append(f"**Response:**\n")
            # Check if content contains code
            if include_code and ('```' in content or 'import ' in content):
                # Attempt to separate text and code
                parts = content.split('```')
                for j, part in enumerate(parts):
                    if j % 2 == 0:
                        # Text content
                        if part.strip():
                            sections.append(f"{part.strip()}\n")
                    else:
                        # Code content
                        # Check if language is specified
                        lines = part.split('\n', 1)
                        if len(lines) > 1 and lines[0].strip() in ['python', 'r', 'bash', 'sql']:
                            lang = lines[0].strip()
                            code = lines[1]
                        else:
                            lang = 'python'  # Default to python
                            code = part
                        sections.append(f"```{lang}\n{code}\n```\n")
            else:
                sections.append(f"{content}\n")
            sections.append("\n---\n")
    return '\n'.join(sections)
 def markdown_to_html(markdown_content: str, title: str = "Biomni Report") -> str:
    """
    Convert markdown to styled HTML.
    Args:
        markdown_content: Markdown string
        title: HTML page title
    Returns:
        HTML string
    """
    # Simple markdown to HTML conversion
    # For production use, consider using a library like markdown or mistune
    html_template = f"""
 <!DOCTYPE html>
 <html lang="en">
 <head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>{title}</title>
    <style>
        body {{
            font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, sans-serif;
            line-height: 1.6;
            max-width: 900px;
            margin: 0 auto;
            padding: 20px;
            color: #333;
        }}
        h1 {{
            color: #2c3e50;
            border-bottom: 3px solid #3498db;
            padding-bottom: 10px;
        }}
        h2 {{
            color: #34495e;
            margin-top: 30px;
            border-bottom: 2px solid #95a5a6;
            padding-bottom: 5px;
        }}
        h3 {{
            color: #555;
        }}
        code {{
            background-color: #f4f4f4;
            padding: 2px 6px;
            border-radius: 3px;
            font-family: 'Monaco', 'Menlo', 'Courier New', monospace;
        }}
        pre {{
            background-color: #f8f8f8;
            border: 1px solid #ddd;
            border-radius: 5px;
            padding: 15px;
            overflow-x: auto;
        }}
        pre code {{
            background-color: transparent;
            padding: 0;
        }}
        hr {{
            border: none;
            border-top: 1px solid #ddd;
            margin: 30px 0;
        }}
        .metadata {{
            background-color: #ecf0f1;
            padding: 15px;
            border-radius: 5px;
            margin-bottom: 20px;
        }}
        .task {{
            background-color: #e8f4f8;
            padding: 10px;
            border-left: 4px solid #3498db;
            margin: 20px 0;
        }}
        .footer {{
            margin-top: 50px;
            text-align: center;
            color: #7f8c8d;
            font-size: 0.9em;
        }}
    </style>
 </head>
 <body>
    <div class="content">
        {markdown_to_html_simple(markdown_content)}
    </div>
    <div class="footer">
        <p>Generated with Biomni | Stanford SNAP Lab</p>
        <p><a href="https://github.com/snap-stanford/biomni">github.com/snap-stanford/biomni</a></p>
    </div>
 </body>
 </html>
 """
    return html_template
 def markdown_to_html_simple(md: str) -> str:
    """Simple markdown to HTML converter (basic implementation)."""
    lines = md.split('\n')
    html_lines = []
    in_code_block = False
    in_list = False
    for line in lines:
        # Code blocks
        if line.startswith('```'):
            if in_code_block:
                html_lines.append('</code></pre>')
                in_code_block = False
            else:
                lang = line[3:].strip()
                html_lines.append(f'<pre><code class="language-{lang}">')
                in_code_block = True
            continue
        if in_code_block:
            html_lines.append(line)
            continue
        # Headers
        if line.startswith('# '):
            html_lines.append(f'<h1>{line[2:]}</h1>')
        elif line.startswith('## '):
            html_lines.append(f'<h2>{line[3:]}</h2>')
        elif line.startswith('### '):
            html_lines.append(f'<h3>{line[4:]}</h3>')
        # Lists
        elif line.startswith('- '):
            if not in_list:
                html_lines.append('<ul>')
                in_list = True
            html_lines.append(f'<li>{line[2:]}</li>')
        else:
            if in_list:
                html_lines.append('</ul>')
                in_list = False
            # Horizontal rule
            if line.strip() == '---':
                html_lines.append('<hr>')
            # Bold
            elif '**' in line:
                line = line.replace('**', '<strong>', 1).replace('**', '</strong>', 1)
                html_lines.append(f'<p>{line}</p>')
            # Regular paragraph
            elif line.strip():
                html_lines.append(f'<p>{line}</p>')
            else:
                html_lines.append('<br>')
    if in_list:
        html_lines.append('</ul>')
    return '\n'.join(html_lines)
 def generate_report(
    conversation_data: Dict[str, Any],
    output_path: Path,
    format: str = 'markdown',
    title: Optional[str] = None
 ):
    """
    Generate formatted report from conversation data.
    Args:
        conversation_data: Conversation history dictionary
        output_path: Output file path
        format: Output format ('markdown', 'html', or 'pdf')
        title: Report title
    """
    messages = conversation_data.get('messages', [])
    if not title:
        title = f"Biomni Analysis - {datetime.now().strftime('%Y-%m-%d')}"
    # Generate markdown
-    md_path = output_path.replace('.pdf', '.md')
+    markdown_content = format_conversation_history(messages)
-    generate_markdown_report(
+
-        title="Biomni Analysis Report",
+    if format == 'markdown':
-        sections=sections,
+        output_path.write_text(markdown_content)
-        metadata=metadata,
+        print(f"✓ Markdown report saved to {output_path}")
-        output_path=md_path
+
-    )
+    elif format == 'html':
        html_content = markdown_to_html(markdown_content, title)
        output_path.write_text(html_content)
        print(f"✓ HTML report saved to {output_path}")
    elif format == 'pdf':
        # For PDF generation, we'd typically use a library like weasyprint or reportlab
        # This is a placeholder implementation
        print("PDF generation requires additional dependencies (weasyprint or reportlab)")
        print("Falling back to HTML format...")
        html_path = output_path.with_suffix('.html')
        html_content = markdown_to_html(markdown_content, title)
        html_path.write_text(html_content)
        print(f"✓ HTML report saved to {html_path}")
        print("  To convert to PDF:")
        print(f"    1. Install weasyprint: pip install weasyprint")
        print(f"    2. Run: weasyprint {html_path} {output_path}")
    # Convert to PDF
    if method == 'weasyprint':
        success = convert_to_pdf_weasyprint(md_path, output_path)
    elif method == 'pandoc':
        success = convert_to_pdf_pandoc(md_path, output_path)
    else:
-        print(f"Unknown method: {method}")
+        raise ValueError(f"Unsupported format: {format}")
        return False
    if success:
        print(f"✓ Report generated: {output_path}")
        print(f"  Markdown: {md_path}")
    else:
        print("✗ Failed to generate PDF")
        print(f"  Markdown available: {md_path}")
    return success
 def main():
-    """CLI for report generation."""
+    """Main entry point for CLI usage."""
    parser = argparse.ArgumentParser(
-        description='Generate formatted PDF reports for Biomni analyses'
+        description="Generate enhanced reports from biomni conversation histories"
    )
    parser.add_argument(
-        'input',
+        '--input',
-        type=str,
+        type=Path,
-        help='Input markdown file or conversation history'
+        required=True,
        help='Input conversation history JSON file'
    )
    parser.add_argument(
-        '-o', '--output',
+        '--output',
-        type=str,
+        type=Path,
-        default='biomni_report.pdf',
+        required=True,
-        help='Output PDF path (default: biomni_report.pdf)'
+        help='Output report file path'
    )
    parser.add_argument(
-        '-m', '--method',
+        '--format',
-        type=str,
+        choices=['markdown', 'html', 'pdf'],
-        choices=['weasyprint', 'pandoc'],
+        default='markdown',
-        default='weasyprint',
+        help='Output format (default: markdown)'
        help='Conversion method (default: weasyprint)'
    )
    parser.add_argument(
-        '--css',
+        '--title',
        type=str,
-        help='Custom CSS stylesheet path'
+        help='Report title (optional)'
    )
    args = parser.parse_args()
-    # Check if input is markdown or conversation history
+    # Load conversation data
    input_path = Path(args.input)
    if not input_path.exists():
        print(f"Error: Input file not found: {args.input}")
        return 1
    # If input is markdown, convert directly
    if input_path.suffix == '.md':
        if args.method == 'weasyprint':
            success = convert_to_pdf_weasyprint(
                str(input_path),
                args.output,
                args.css
            )
        else:
            success = convert_to_pdf_pandoc(str(input_path), args.output)
        return 0 if success else 1
    # Otherwise, assume it's conversation history (JSON)
    try:
-        import json
+        with open(args.input, 'r') as f:
-        with open(input_path) as f:
+            conversation_data = json.load(f)
-            history = json.load(f)
+    except FileNotFoundError:
-
+        print(f"❌ Input file not found: {args.input}")
-        success = create_biomni_report(
+        return 1
            history,
            args.output,
            args.method
        )
        return 0 if success else 1
    except json.JSONDecodeError:
-        print("Error: Input file is not valid JSON or markdown")
+        print(f"❌ Invalid JSON in input file: {args.input}")
        return 1
    # Generate report
    try:
        generate_report(
            conversation_data,
            args.output,
            format=args.format,
            title=args.title
        )
        return 0
    except Exception as e:
        print(f"❌ Error generating report: {e}")
        return 1
-if __name__ == "__main__":
+if __name__ == '__main__':
    import sys
    sys.exit(main())
--- a/scientific-packages/biomni/scripts/setup_environment.py
+++ b/scientific-packages/biomni/scripts/setup_environment.py
@@ -1,230 +1,355 @@
 #!/usr/bin/env python3
 """
-Biomni Environment Setup and Validation Script
+Interactive setup script for biomni environment configuration.
-This script helps users set up and validate their Biomni environment,
+This script helps users set up:
-including checking dependencies, API keys, and data availability.
+1. Conda environment with required dependencies
 2. API keys for LLM providers
 3. Data lake directory configuration
 4. MCP server setup (optional)
 Usage:
    python setup_environment.py
 """
 import os
 import sys
 import subprocess
 from pathlib import Path
-from typing import Dict, List, Tuple
+from typing import Dict, Optional
-def check_python_version() -> Tuple[bool, str]:
+def check_conda_installed() -> bool:
-    """Check if Python version is compatible."""
+    """Check if conda is available in the system."""
    version = sys.version_info
    if version.major == 3 and version.minor >= 8:
        return True, f"Python {version.major}.{version.minor}.{version.micro} ✓"
    else:
        return False, f"Python {version.major}.{version.minor} - requires Python 3.8+"
 def check_conda_env() -> Tuple[bool, str]:
    """Check if running in biomni conda environment."""
    conda_env = os.environ.get('CONDA_DEFAULT_ENV', None)
    if conda_env == 'biomni_e1':
        return True, f"Conda environment: {conda_env} ✓"
    else:
        return False, f"Not in biomni_e1 environment (current: {conda_env})"
 def check_package_installed(package: str) -> bool:
    """Check if a Python package is installed."""
    try:
-        __import__(package)
+        subprocess.run(
            ['conda', '--version'],
            capture_output=True,
            check=True
        )
        return True
-    except ImportError:
+    except (subprocess.CalledProcessError, FileNotFoundError):
        return False
-def check_dependencies() -> Tuple[bool, List[str]]:
+def setup_conda_environment():
-    """Check for required and optional dependencies."""
+    """Guide user through conda environment setup."""
-    required = ['biomni']
+    print("\n=== Conda Environment Setup ===")
    optional = ['weasyprint', 'markdown2pdf']
-    missing_required = [pkg for pkg in required if not check_package_installed(pkg)]
+    if not check_conda_installed():
-    missing_optional = [pkg for pkg in optional if not check_package_installed(pkg)]
+        print("❌ Conda not found. Please install Miniconda or Anaconda:")
        print("   https://docs.conda.io/en/latest/miniconda.html")
        return False
-    messages = []
+    print("✓ Conda is installed")
    success = len(missing_required) == 0
-    if missing_required:
+    # Check if biomni_e1 environment exists
-        messages.append(f"Missing required packages: {', '.join(missing_required)}")
+    result = subprocess.run(
-        messages.append("Install with: pip install biomni --upgrade")
+        ['conda', 'env', 'list'],
        capture_output=True,
        text=True
    )
    if 'biomni_e1' in result.stdout:
        print("✓ biomni_e1 environment already exists")
        return True
    print("\nCreating biomni_e1 conda environment...")
    print("This will install Python 3.10 and required dependencies.")
    response = input("Proceed? [y/N]: ").strip().lower()
    if response != 'y':
        print("Skipping conda environment setup")
        return False
    try:
        # Create conda environment
        subprocess.run(
            ['conda', 'create', '-n', 'biomni_e1', 'python=3.10', '-y'],
            check=True
        )
        print("\n✓ Conda environment created successfully")
        print("\nTo activate: conda activate biomni_e1")
        print("Then install biomni: pip install biomni --upgrade")
        return True
    except subprocess.CalledProcessError as e:
        print(f"❌ Failed to create conda environment: {e}")
        return False
 def setup_api_keys() -> Dict[str, str]:
    """Interactive API key configuration."""
    print("\n=== API Key Configuration ===")
    print("Biomni supports multiple LLM providers.")
    print("At minimum, configure one provider.")
    api_keys = {}
    # Anthropic (recommended)
    print("\n1. Anthropic Claude (Recommended)")
    print("   Get your API key from: https://console.anthropic.com/")
    anthropic_key = input("   Enter ANTHROPIC_API_KEY (or press Enter to skip): ").strip()
    if anthropic_key:
        api_keys['ANTHROPIC_API_KEY'] = anthropic_key
    # OpenAI
    print("\n2. OpenAI")
    print("   Get your API key from: https://platform.openai.com/api-keys")
    openai_key = input("   Enter OPENAI_API_KEY (or press Enter to skip): ").strip()
    if openai_key:
        api_keys['OPENAI_API_KEY'] = openai_key
    # Google Gemini
    print("\n3. Google Gemini")
    print("   Get your API key from: https://makersuite.google.com/app/apikey")
    google_key = input("   Enter GOOGLE_API_KEY (or press Enter to skip): ").strip()
    if google_key:
        api_keys['GOOGLE_API_KEY'] = google_key
    # Groq
    print("\n4. Groq")
    print("   Get your API key from: https://console.groq.com/keys")
    groq_key = input("   Enter GROQ_API_KEY (or press Enter to skip): ").strip()
    if groq_key:
        api_keys['GROQ_API_KEY'] = groq_key
    if not api_keys:
        print("\n⚠️  No API keys configured. You'll need at least one to use biomni.")
        return {}
    return api_keys
 def save_api_keys(api_keys: Dict[str, str], method: str = 'env_file'):
    """Save API keys using specified method."""
    if method == 'env_file':
        env_file = Path.cwd() / '.env'
        # Read existing .env if present
        existing_vars = {}
        if env_file.exists():
            with open(env_file, 'r') as f:
                for line in f:
                    line = line.strip()
                    if line and not line.startswith('#'):
                        if '=' in line:
                            key, val = line.split('=', 1)
                            existing_vars[key.strip()] = val.strip()
        # Update with new keys
        existing_vars.update(api_keys)
        # Write to .env
        with open(env_file, 'w') as f:
            f.write("# Biomni API Keys\n")
            f.write(f"# Generated by setup_environment.py\n\n")
            for key, value in existing_vars.items():
                f.write(f"{key}={value}\n")
        print(f"\n✓ API keys saved to {env_file}")
        print("  Keys will be loaded automatically when biomni runs in this directory")
    elif method == 'shell_export':
        shell_file = Path.home() / '.bashrc'  # or .zshrc for zsh users
        print("\n📋 Add these lines to your shell configuration:")
        for key, value in api_keys.items():
            print(f"   export {key}=\"{value}\"")
        print(f"\nThen run: source {shell_file}")
 def setup_data_directory() -> Optional[Path]:
    """Configure biomni data lake directory."""
    print("\n=== Data Lake Configuration ===")
    print("Biomni requires ~11GB for integrated biomedical databases.")
    default_path = Path.cwd() / 'biomni_data'
    print(f"\nDefault location: {default_path}")
    response = input("Use default location? [Y/n]: ").strip().lower()
    if response == 'n':
        custom_path = input("Enter custom path: ").strip()
        data_path = Path(custom_path).expanduser().resolve()
    else:
-        messages.append("Required packages: ✓")
+        data_path = default_path
-    if missing_optional:
+    # Create directory if it doesn't exist
-        messages.append(f"Missing optional packages: {', '.join(missing_optional)}")
+    data_path.mkdir(parents=True, exist_ok=True)
        messages.append("For PDF reports, install: pip install weasyprint")
-    return success, messages
+    print(f"\n✓ Data directory configured: {data_path}")
    print("  Data will be downloaded automatically on first use")
    return data_path
-def check_api_keys() -> Tuple[bool, Dict[str, bool]]:
+def test_installation(data_path: Path):
-    """Check which API keys are configured."""
+    """Test biomni installation with a simple query."""
-    api_keys = {
+    print("\n=== Installation Test ===")
-        'ANTHROPIC_API_KEY': os.environ.get('ANTHROPIC_API_KEY'),
+    print("Testing biomni installation with a simple query...")
        'OPENAI_API_KEY': os.environ.get('OPENAI_API_KEY'),
        'GEMINI_API_KEY': os.environ.get('GEMINI_API_KEY'),
        'GROQ_API_KEY': os.environ.get('GROQ_API_KEY'),
    }
-    configured = {key: bool(value) for key, value in api_keys.items()}
+    response = input("Run test? [Y/n]: ").strip().lower()
-    has_any = any(configured.values())
+    if response == 'n':
        print("Skipping test")
        return
-    return has_any, configured
+    test_code = f'''
 import os
 from biomni.agent import A1
 # Use environment variables for API keys
 agent = A1(path='{data_path}', llm='claude-sonnet-4-20250514')
-def check_data_directory(data_path: str = './data') -> Tuple[bool, str]:
+# Simple test query
-    """Check if Biomni data directory exists and has content."""
+result = agent.go("What is the primary function of the TP53 gene?")
-    path = Path(data_path)
+print("Test result:", result)
 '''
-    if not path.exists():
+    test_file = Path('test_biomni.py')
-        return False, f"Data directory not found at {data_path}"
+    with open(test_file, 'w') as f:
        f.write(test_code)
-    # Check if directory has files (data has been downloaded)
+    print(f"\nTest script created: {test_file}")
-    files = list(path.glob('*'))
+    print("Running test...")
    if len(files) == 0:
        return False, f"Data directory exists but is empty. Run agent once to download."
    # Rough size check (should be ~11GB)
    total_size = sum(f.stat().st_size for f in path.rglob('*') if f.is_file())
    size_gb = total_size / (1024**3)
    if size_gb < 1:
        return False, f"Data directory exists but seems incomplete ({size_gb:.1f} GB)"
    return True, f"Data directory: {data_path} ({size_gb:.1f} GB) ✓"
 def check_disk_space(required_gb: float = 20) -> Tuple[bool, str]:
    """Check if sufficient disk space is available."""
    try:
-        import shutil
+        subprocess.run([sys.executable, str(test_file)], check=True)
-        stat = shutil.disk_usage('.')
+        print("\n✓ Test completed successfully!")
-        free_gb = stat.free / (1024**3)
+        test_file.unlink()  # Clean up test file
-
+    except subprocess.CalledProcessError:
-        if free_gb >= required_gb:
+        print("\n❌ Test failed. Check your configuration.")
-            return True, f"Disk space: {free_gb:.1f} GB available ✓"
+        print(f"   Test script saved as {test_file} for debugging")
        else:
            return False, f"Low disk space: {free_gb:.1f} GB (need {required_gb} GB)"
    except Exception as e:
        return False, f"Could not check disk space: {e}"
-def test_biomni_import() -> Tuple[bool, str]:
+def generate_example_script(data_path: Path):
-    """Test if Biomni can be imported and initialized."""
+    """Generate example usage script."""
-    try:
+    example_code = f'''#!/usr/bin/env python3
-        from biomni.agent import A1
+"""
-        from biomni.config import default_config
+Example biomni usage script
        return True, "Biomni import successful ✓"
    except ImportError as e:
        return False, f"Cannot import Biomni: {e}"
    except Exception as e:
        return False, f"Biomni import error: {e}"
 This demonstrates basic biomni usage patterns.
 Modify this script for your research tasks.
 """
-def suggest_fixes(results: Dict[str, Tuple[bool, any]]) -> List[str]:
+from biomni.agent import A1
    """Generate suggestions for fixing issues."""
    suggestions = []
-    if not results['python'][0]:
+# Initialize agent
-        suggestions.append("➜ Upgrade Python to 3.8 or higher")
+agent = A1(
    path='{data_path}',
    llm='claude-sonnet-4-20250514'  # or your preferred LLM
 )
-    if not results['conda'][0]:
+# Example 1: Simple gene query
-        suggestions.append("➜ Activate biomni environment: conda activate biomni_e1")
+print("Example 1: Gene function query")
 result = agent.go("""
 What are the main functions of the BRCA1 gene?
 Include information about:
 - Molecular function
 - Associated diseases
 - Protein interactions
 """)
 print(result)
 print("-" * 80)
-    if not results['dependencies'][0]:
+# Example 2: Data analysis
-        suggestions.append("➜ Install Biomni: pip install biomni --upgrade")
+print("\\nExample 2: GWAS analysis")
 result = agent.go("""
 Explain how to analyze GWAS summary statistics for:
 1. Identifying genome-wide significant variants
 2. Mapping variants to genes
 3. Pathway enrichment analysis
 """)
 print(result)
-    if not results['api_keys'][0]:
+# Save conversation history
-        suggestions.append("➜ Set API key: export ANTHROPIC_API_KEY='your-key'")
+agent.save_conversation_history("example_results.pdf")
-        suggestions.append("   Or create .env file with API keys")
+print("\\nResults saved to example_results.pdf")
 '''
-    if not results['data'][0]:
+    example_file = Path('example_biomni_usage.py')
-        suggestions.append("➜ Data will auto-download on first agent.go() call")
+    with open(example_file, 'w') as f:
        f.write(example_code)
-    if not results['disk_space'][0]:
+    print(f"\n✓ Example script created: {example_file}")
        suggestions.append("➜ Free up disk space (need ~20GB total)")
    return suggestions
 def main():
-    """Run all environment checks and display results."""
+    """Main setup workflow."""
    print("=" * 60)
-    print("Biomni Environment Validation")
+    print("Biomni Environment Setup")
    print("=" * 60)
    print()
-    # Run all checks
+    # Step 1: Conda environment
-    results = {}
+    conda_success = setup_conda_environment()
-    print("Checking Python version...")
+    if conda_success:
-    results['python'] = check_python_version()
+        print("\n⚠️  Remember to activate the environment:")
-    print(f"  {results['python'][1]}")
+        print("   conda activate biomni_e1")
-    print()
+        print("   pip install biomni --upgrade")
-    print("Checking conda environment...")
+    # Step 2: API keys
-    results['conda'] = check_conda_env()
+    api_keys = setup_api_keys()
    print(f"  {results['conda'][1]}")
    print()
-    print("Checking dependencies...")
+    if api_keys:
-    results['dependencies'] = check_dependencies()
+        print("\nHow would you like to store API keys?")
-    for msg in results['dependencies'][1]:
+        print("1. .env file (recommended, local to this directory)")
-        print(f"  {msg}")
+        print("2. Shell export (add to .bashrc/.zshrc)")
    print()
-    print("Checking API keys...")
+        choice = input("Choose [1/2]: ").strip()
    results['api_keys'] = check_api_keys()
    has_keys, key_status = results['api_keys']
    for key, configured in key_status.items():
        status = "✓" if configured else "✗"
        print(f"  {key}: {status}")
    print()
-    print("Checking Biomni data directory...")
+        if choice == '2':
-    results['data'] = check_data_directory()
+            save_api_keys(api_keys, method='shell_export')
-    print(f"  {results['data'][1]}")
+        else:
-    print()
+            save_api_keys(api_keys, method='env_file')
-    print("Checking disk space...")
+    # Step 3: Data directory
-    results['disk_space'] = check_disk_space()
+    data_path = setup_data_directory()
    print(f"  {results['disk_space'][1]}")
    print()
-    print("Testing Biomni import...")
+    # Step 4: Generate example script
-    results['biomni_import'] = test_biomni_import()
+    if data_path:
-    print(f"  {results['biomni_import'][1]}")
+        generate_example_script(data_path)
-    print()
+
    # Step 5: Test installation (optional)
    if api_keys and data_path:
        test_installation(data_path)
    # Summary
    print("\n" + "=" * 60)
    print("Setup Complete!")
    print("=" * 60)
    all_passed = all(result[0] for result in results.values())
-    if all_passed:
+    if conda_success:
-        print("✓ All checks passed! Environment is ready.")
+        print("✓ Conda environment: biomni_e1")
-        print()
+
-        print("Quick start:")
+    if api_keys:
-        print("  from biomni.agent import A1")
+        print(f"✓ API keys configured: {', '.join(api_keys.keys())}")
-        print("  agent = A1(path='./data', llm='claude-sonnet-4-20250514')")
+
-        print("  agent.go('Your biomedical task')")
+    if data_path:
        print(f"✓ Data directory: {data_path}")
    print("\nNext steps:")
    if conda_success:
        print("1. conda activate biomni_e1")
        print("2. pip install biomni --upgrade")
        print("3. Run example_biomni_usage.py to test")
    else:
-        print("⚠ Some checks failed. See suggestions below:")
+        print("1. Install conda/miniconda")
-        print()
+        print("2. Run this script again")
        suggestions = suggest_fixes(results)
        for suggestion in suggestions:
            print(suggestion)
-    print("=" * 60)
+    print("\nFor documentation, see:")
-
+    print("  - GitHub: https://github.com/snap-stanford/biomni")
-    return 0 if all_passed else 1
+    print("  - Paper: https://www.biorxiv.org/content/10.1101/2025.05.30.656746v1")
 if __name__ == "__main__":
-    sys.exit(main())
+    try:
        main()
    except KeyboardInterrupt:
        print("\n\nSetup interrupted by user")
        sys.exit(1)
    except Exception as e:
        print(f"\n❌ Error during setup: {e}")
        sys.exit(1)