Consolidate skills

2026-01-26 16:58:56 +08:00 · 2025-11-13 18:50:42 -08:00
parent cc99fdb57d
commit 8be6c6c307
786 changed files with 0 additions and 153 deletions
--- a/scientific-skills/reactome-database/reactome-database/SKILL.md
+++ b/scientific-skills/reactome-database/reactome-database/SKILL.md
@@ -0,0 +1,272 @@
+---
+name: reactome-database
+description: "Query Reactome REST API for pathway analysis, enrichment, gene-pathway mapping, disease pathways, molecular interactions, expression analysis, for systems biology studies."
+---
+
+# Reactome Database
+
+## Overview
+
+Reactome is a free, open-source, curated pathway database with 2,825+ human pathways. Query biological pathways, perform overrepresentation and expression analysis, map genes to pathways, explore molecular interactions via REST API and Python client for systems biology research.
+
+## When to Use This Skill
+
+This skill should be used when:
+- Performing pathway enrichment analysis on gene or protein lists
+- Analyzing gene expression data to identify relevant biological pathways
+- Querying specific pathway information, reactions, or molecular interactions
+- Mapping genes or proteins to biological pathways and processes
+- Exploring disease-related pathways and mechanisms
+- Visualizing analysis results in the Reactome Pathway Browser
+- Conducting comparative pathway analysis across species
+
+## Core Capabilities
+
+Reactome provides two main API services and a Python client library:
+
+### 1. Content Service - Data Retrieval
+
+Query and retrieve biological pathway data, molecular interactions, and entity information.
+
+**Common operations:**
+- Retrieve pathway information and hierarchies
+- Query specific entities (proteins, reactions, complexes)
+- Get participating molecules in pathways
+- Access database version and metadata
+- Explore pathway compartments and locations
+
+**API Base URL:** `https://reactome.org/ContentService`
+
+### 2. Analysis Service - Pathway Analysis
+
+Perform computational analysis on gene lists and expression data.
+
+**Analysis types:**
+- **Overrepresentation Analysis**: Identify statistically significant pathways from gene/protein lists
+- **Expression Data Analysis**: Analyze gene expression datasets to find relevant pathways
+- **Species Comparison**: Compare pathway data across different organisms
+
+**API Base URL:** `https://reactome.org/AnalysisService`
+
+### 3. reactome2py Python Package
+
+Python client library that wraps Reactome API calls for easier programmatic access.
+
+**Installation:**
+```bash
+pip install reactome2py
+```
+
+**Note:** The reactome2py package (version 3.0.0, released January 2021) is functional but not actively maintained. For the most up-to-date functionality, consider using direct REST API calls.
+
+## Querying Pathway Data
+
+### Using Content Service REST API
+
+The Content Service uses REST protocol and returns data in JSON or plain text formats.
+
+**Get database version:**
+```python
+import requests
+
+response = requests.get("https://reactome.org/ContentService/data/database/version")
+version = response.text
+print(f"Reactome version: {version}")
+```
+
+**Query a specific entity:**
+```python
+import requests
+
+entity_id = "R-HSA-69278"  # Example pathway ID
+response = requests.get(f"https://reactome.org/ContentService/data/query/{entity_id}")
+data = response.json()
+```
+
+**Get participating molecules in a pathway:**
+```python
+import requests
+
+event_id = "R-HSA-69278"
+response = requests.get(
+    f"https://reactome.org/ContentService/data/event/{event_id}/participatingPhysicalEntities"
+)
+molecules = response.json()
+```
+
+### Using reactome2py Package
+
+```python
+import reactome2py
+from reactome2py import content
+
+# Query pathway information
+pathway_info = content.query_by_id("R-HSA-69278")
+
+# Get database version
+version = content.get_database_version()
+```
+
+**For detailed API endpoints and parameters**, refer to `references/api_reference.md` in this skill.
+
+## Performing Pathway Analysis
+
+### Overrepresentation Analysis
+
+Submit a list of gene/protein identifiers to find enriched pathways.
+
+**Using REST API:**
+```python
+import requests
+
+# Prepare identifier list
+identifiers = ["TP53", "BRCA1", "EGFR", "MYC"]
+data = "\n".join(identifiers)
+
+# Submit analysis
+response = requests.post(
+    "https://reactome.org/AnalysisService/identifiers/",
+    headers={"Content-Type": "text/plain"},
+    data=data
+)
+
+result = response.json()
+token = result["summary"]["token"]  # Save token to retrieve results later
+
+# Access pathways
+for pathway in result["pathways"]:
+    print(f"{pathway['stId']}: {pathway['name']} (p-value: {pathway['entities']['pValue']})")
+```
+
+**Retrieve analysis by token:**
+```python
+# Token is valid for 7 days
+response = requests.get(f"https://reactome.org/AnalysisService/token/{token}")
+results = response.json()
+```
+
+### Expression Data Analysis
+
+Analyze gene expression datasets with quantitative values.
+
+**Input format (TSV with header starting with #):**
+```
+#Gene	Sample1	Sample2	Sample3
+TP53	2.5	3.1	2.8
+BRCA1	1.2	1.5	1.3
+EGFR	4.5	4.2	4.8
+```
+
+**Submit expression data:**
+```python
+import requests
+
+# Read TSV file
+with open("expression_data.tsv", "r") as f:
+    data = f.read()
+
+response = requests.post(
+    "https://reactome.org/AnalysisService/identifiers/",
+    headers={"Content-Type": "text/plain"},
+    data=data
+)
+
+result = response.json()
+```
+
+### Species Projection
+
+Map identifiers to human pathways exclusively using the `/projection/` endpoint:
+
+```python
+response = requests.post(
+    "https://reactome.org/AnalysisService/identifiers/projection/",
+    headers={"Content-Type": "text/plain"},
+    data=data
+)
+```
+
+## Visualizing Results
+
+Analysis results can be visualized in the Reactome Pathway Browser by constructing URLs with the analysis token:
+
+```python
+token = result["summary"]["token"]
+pathway_id = "R-HSA-69278"
+url = f"https://reactome.org/PathwayBrowser/#{pathway_id}&DTAB=AN&ANALYSIS={token}"
+print(f"View results: {url}")
+```
+
+## Working with Analysis Tokens
+
+- Analysis tokens are valid for **7 days**
+- Tokens allow retrieval of previously computed results without re-submission
+- Store tokens to access results across sessions
+- Use `GET /token/{TOKEN}` endpoint to retrieve results
+
+## Data Formats and Identifiers
+
+### Supported Identifier Types
+
+Reactome accepts various identifier formats:
+- UniProt accessions (e.g., P04637)
+- Gene symbols (e.g., TP53)
+- Ensembl IDs (e.g., ENSG00000141510)
+- EntrezGene IDs (e.g., 7157)
+- ChEBI IDs for small molecules
+
+The system automatically detects identifier types.
+
+### Input Format Requirements
+
+**For overrepresentation analysis:**
+- Plain text list of identifiers (one per line)
+- OR single column in TSV format
+
+**For expression analysis:**
+- TSV format with mandatory header row starting with "#"
+- Column 1: identifiers
+- Columns 2+: numeric expression values
+- Use period (.) as decimal separator
+
+### Output Format
+
+All API responses return JSON containing:
+- `pathways`: Array of enriched pathways with statistical metrics
+- `summary`: Analysis metadata and token
+- `entities`: Matched and unmapped identifiers
+- Statistical values: pValue, FDR (false discovery rate)
+
+## Helper Scripts
+
+This skill includes `scripts/reactome_query.py`, a helper script for common Reactome operations:
+
+```bash
+# Query pathway information
+python scripts/reactome_query.py query R-HSA-69278
+
+# Perform overrepresentation analysis
+python scripts/reactome_query.py analyze gene_list.txt
+
+# Get database version
+python scripts/reactome_query.py version
+```
+
+## Additional Resources
+
+- **API Documentation**: https://reactome.org/dev
+- **User Guide**: https://reactome.org/userguide
+- **Documentation Portal**: https://reactome.org/documentation
+- **Data Downloads**: https://reactome.org/download-data
+- **reactome2py Docs**: https://reactome.github.io/reactome2py/
+
+For comprehensive API endpoint documentation, see `references/api_reference.md` in this skill.
+
+## Current Database Statistics (Version 94, September 2025)
+
+- 2,825 human pathways
+- 16,002 reactions
+- 11,630 proteins
+- 2,176 small molecules
+- 1,070 drugs
+- 41,373 literature references
--- a/scientific-skills/reactome-database/reactome-database/references/api_reference.md
+++ b/scientific-skills/reactome-database/reactome-database/references/api_reference.md
@@ -0,0 +1,465 @@
+# Reactome API Reference
+
+This document provides comprehensive reference information for Reactome's REST APIs.
+
+## Base URLs
+
+- **Content Service**: `https://reactome.org/ContentService`
+- **Analysis Service**: `https://reactome.org/AnalysisService`
+
+## Content Service API
+
+The Content Service provides access to Reactome's curated pathway data through REST endpoints.
+
+### Database Information
+
+#### Get Database Version
+```
+GET /data/database/version
+```
+
+**Response:** Plain text containing the database version number
+
+**Example:**
+```python
+import requests
+response = requests.get("https://reactome.org/ContentService/data/database/version")
+print(response.text)  # e.g., "94"
+```
+
+#### Get Database Name
+```
+GET /data/database/name
+```
+
+**Response:** Plain text containing the database name
+
+### Entity Queries
+
+#### Query Entity by ID
+```
+GET /data/query/{id}
+```
+
+**Parameters:**
+- `id` (path): Stable identifier or database ID (e.g., "R-HSA-69278")
+
+**Response:** JSON object containing full entity information including:
+- `stId`: Stable identifier
+- `displayName`: Human-readable name
+- `schemaClass`: Entity type (Pathway, Reaction, Complex, etc.)
+- `species`: Array of species information
+- Additional type-specific fields
+
+**Example:**
+```python
+import requests
+response = requests.get("https://reactome.org/ContentService/data/query/R-HSA-69278")
+pathway = response.json()
+print(f"Pathway: {pathway['displayName']}")
+print(f"Species: {pathway['species'][0]['displayName']}")
+```
+
+#### Query Entity Attribute
+```
+GET /data/query/{id}/{attribute}
+```
+
+**Parameters:**
+- `id` (path): Entity identifier
+- `attribute` (path): Specific attribute name (e.g., "displayName", "compartment")
+
+**Response:** JSON or plain text depending on attribute type
+
+**Example:**
+```python
+response = requests.get("https://reactome.org/ContentService/data/query/R-HSA-69278/displayName")
+name = response.text
+```
+
+### Pathway Queries
+
+#### Get Pathway Entities
+```
+GET /data/event/{id}/participatingPhysicalEntities
+```
+
+**Parameters:**
+- `id` (path): Pathway or reaction stable identifier
+
+**Response:** JSON array of physical entities (proteins, complexes, small molecules) participating in the pathway
+
+**Example:**
+```python
+response = requests.get(
+    "https://reactome.org/ContentService/data/event/R-HSA-69278/participatingPhysicalEntities"
+)
+entities = response.json()
+for entity in entities:
+    print(f"{entity['stId']}: {entity['displayName']} ({entity['schemaClass']})")
+```
+
+#### Get Contained Events
+```
+GET /data/pathway/{id}/containedEvents
+```
+
+**Parameters:**
+- `id` (path): Pathway stable identifier
+
+**Response:** JSON array of events (reactions, subpathways) contained within the pathway
+
+### Search Queries
+
+#### Search by Name
+```
+GET /data/query?name={query}
+```
+
+**Parameters:**
+- `name` (query): Search term
+
+**Response:** JSON array of matching entities
+
+**Example:**
+```python
+response = requests.get(
+    "https://reactome.org/ContentService/data/query",
+    params={"name": "glycolysis"}
+)
+results = response.json()
+```
+
+## Analysis Service API
+
+The Analysis Service performs pathway enrichment and expression analysis.
+
+### Submit Analysis
+
+#### Submit Identifiers (POST)
+```
+POST /identifiers/
+POST /identifiers/projection/  # Map to human pathways only
+```
+
+**Headers:**
+- `Content-Type: text/plain`
+
+**Body:**
+- For overrepresentation: Plain text list of identifiers (one per line)
+- For expression analysis: TSV format with header starting with "#"
+
+**Expression data format:**
+```
+#Gene	Sample1	Sample2	Sample3
+TP53	2.5	3.1	2.8
+BRCA1	1.2	1.5	1.3
+```
+
+**Response:** JSON object containing:
+```json
+{
+  "summary": {
+    "token": "MzUxODM3NTQzMDAwMDA1ODI4MA==",
+    "type": "OVERREPRESENTATION",
+    "species": "9606",
+    "sampleName": null,
+    "fileName": null,
+    "text": true
+  },
+  "pathways": [
+    {
+      "stId": "R-HSA-69278",
+      "name": "Cell Cycle, Mitotic",
+      "species": {
+        "name": "Homo sapiens",
+        "taxId": "9606"
+      },
+      "entities": {
+        "found": 15,
+        "total": 450,
+        "pValue": 0.0000234,
+        "fdr": 0.00156
+      },
+      "reactions": {
+        "found": 12,
+        "total": 342
+      }
+    }
+  ],
+  "resourceSummary": [
+    {
+      "resource": "TOTAL",
+      "pathways": 25
+    }
+  ]
+}
+```
+
+**Example:**
+```python
+import requests
+
+# Overrepresentation analysis
+identifiers = ["TP53", "BRCA1", "EGFR", "MYC", "CDK1"]
+data = "\n".join(identifiers)
+
+response = requests.post(
+    "https://reactome.org/AnalysisService/identifiers/",
+    headers={"Content-Type": "text/plain"},
+    data=data
+)
+
+result = response.json()
+token = result["summary"]["token"]
+
+# Process pathways
+for pathway in result["pathways"]:
+    print(f"Pathway: {pathway['name']}")
+    print(f"  Found: {pathway['entities']['found']}/{pathway['entities']['total']}")
+    print(f"  p-value: {pathway['entities']['pValue']:.6f}")
+    print(f"  FDR: {pathway['entities']['fdr']:.6f}")
+```
+
+#### Submit File (Form Upload)
+```
+POST /identifiers/form/
+```
+
+**Content-Type:** `multipart/form-data`
+
+**Parameters:**
+- `file`: File containing identifiers or expression data
+
+#### Submit URL
+```
+POST /identifiers/url/
+```
+
+**Parameters:**
+- `url`: URL pointing to data file
+
+### Retrieve Analysis Results
+
+#### Get Results by Token
+```
+GET /token/{token}
+GET /token/{token}/projection/  # With species projection
+```
+
+**Parameters:**
+- `token` (path): Analysis token returned from submission
+
+**Response:** Same structure as initial analysis response
+
+**Example:**
+```python
+token = "MzUxODM3NTQzMDAwMDA1ODI4MA=="
+response = requests.get(f"https://reactome.org/AnalysisService/token/{token}")
+results = response.json()
+```
+
+**Note:** Tokens are valid for 7 days
+
+#### Filter Results
+```
+GET /token/{token}/filter/pathways?resource={resource}
+```
+
+**Parameters:**
+- `token` (path): Analysis token
+- `resource` (query): Resource filter (e.g., "TOTAL", "UNIPROT", "ENSEMBL")
+
+### Download Results
+
+#### Download as CSV
+```
+GET /download/{token}/pathways/{resource}/result.csv
+```
+
+#### Download Mapping
+```
+GET /download/{token}/entities/found/{resource}/mapping.tsv
+```
+
+## Supported Identifiers
+
+Reactome automatically detects and processes various identifier types:
+
+### Proteins and Genes
+- **UniProt**: P04637
+- **Gene Symbol**: TP53
+- **Ensembl**: ENSG00000141510
+- **EntrezGene**: 7157
+- **RefSeq**: NM_000546
+- **OMIM**: 191170
+
+### Small Molecules
+- **ChEBI**: CHEBI:15377
+- **KEGG Compound**: C00031
+- **PubChem**: 702
+
+### Other
+- **miRBase**: hsa-miR-21
+- **InterPro**: IPR011616
+
+## Response Formats
+
+### JSON Objects
+
+Entity objects contain standardized fields:
+```json
+{
+  "stId": "R-HSA-69278",
+  "displayName": "Cell Cycle, Mitotic",
+  "schemaClass": "Pathway",
+  "species": [
+    {
+      "dbId": 48887,
+      "displayName": "Homo sapiens",
+      "taxId": "9606"
+    }
+  ],
+  "isInDisease": false
+}
+```
+
+### TSV Format
+
+For bulk queries, TSV returns:
+```
+stId	displayName	schemaClass
+R-HSA-69278	Cell Cycle, Mitotic	Pathway
+R-HSA-69306	DNA Replication	Pathway
+```
+
+## Error Responses
+
+### HTTP Status Codes
+- `200`: Success
+- `400`: Bad Request (invalid parameters)
+- `404`: Not Found (invalid ID)
+- `415`: Unsupported Media Type
+- `500`: Internal Server Error
+
+### Error JSON Structure
+```json
+{
+  "code": 404,
+  "reason": "NOT_FOUND",
+  "messages": ["Pathway R-HSA-INVALID not found"]
+}
+```
+
+## Rate Limiting
+
+Reactome does not currently enforce strict rate limits, but consider:
+- Implementing reasonable delays between requests
+- Using batch operations when available
+- Caching results when appropriate
+- Respecting the 7-day token validity period
+
+## Best Practices
+
+### 1. Use Analysis Tokens
+Store and reuse analysis tokens to avoid redundant computation:
+```python
+# Store token after analysis
+token = result["summary"]["token"]
+save_token(token)  # Save to file or database
+
+# Retrieve results later
+result = requests.get(f"https://reactome.org/AnalysisService/token/{token}")
+```
+
+### 2. Batch Queries
+Submit multiple identifiers in a single request rather than individual queries:
+```python
+# Good: Single batch request
+identifiers = ["TP53", "BRCA1", "EGFR"]
+result = analyze_batch(identifiers)
+
+# Avoid: Multiple individual requests
+# for gene in genes:
+#     result = analyze_single(gene)  # Don't do this
+```
+
+### 3. Handle Species Appropriately
+Use `/projection/` endpoints to map non-human identifiers to human pathways:
+```python
+# For mouse genes, project to human pathways
+response = requests.post(
+    "https://reactome.org/AnalysisService/identifiers/projection/",
+    headers={"Content-Type": "text/plain"},
+    data=mouse_genes
+)
+```
+
+### 4. Process Large Result Sets
+For analyses returning many pathways, filter by significance:
+```python
+significant_pathways = [
+    p for p in result["pathways"]
+    if p["entities"]["fdr"] < 0.05
+]
+```
+
+## Integration Examples
+
+### Complete Analysis Workflow
+```python
+import requests
+import json
+
+def analyze_gene_list(genes, output_file="analysis_results.json"):
+    """
+    Perform pathway enrichment analysis on a list of genes
+    """
+    # Submit analysis
+    data = "\n".join(genes)
+    response = requests.post(
+        "https://reactome.org/AnalysisService/identifiers/",
+        headers={"Content-Type": "text/plain"},
+        data=data
+    )
+
+    if response.status_code != 200:
+        raise Exception(f"Analysis failed: {response.text}")
+
+    result = response.json()
+    token = result["summary"]["token"]
+
+    # Filter significant pathways (FDR < 0.05)
+    significant = [
+        p for p in result["pathways"]
+        if p["entities"]["fdr"] < 0.05
+    ]
+
+    # Save results
+    with open(output_file, "w") as f:
+        json.dump({
+            "token": token,
+            "total_pathways": len(result["pathways"]),
+            "significant_pathways": len(significant),
+            "pathways": significant
+        }, f, indent=2)
+
+    # Generate browser URL for top pathway
+    if significant:
+        top_pathway = significant[0]
+        url = f"https://reactome.org/PathwayBrowser/#{top_pathway['stId']}&DTAB=AN&ANALYSIS={token}"
+        print(f"View top result: {url}")
+
+    return result
+
+# Usage
+genes = ["TP53", "BRCA1", "BRCA2", "CDK1", "CDK2"]
+result = analyze_gene_list(genes)
+```
+
+## Additional Resources
+
+- **Interactive API Documentation**: https://reactome.org/dev/content-service
+- **Analysis Service Docs**: https://reactome.org/dev/analysis
+- **User Guide**: https://reactome.org/userguide
+- **Data Downloads**: https://reactome.org/download-data
--- a/scientific-skills/reactome-database/reactome-database/scripts/reactome_query.py
+++ b/scientific-skills/reactome-database/reactome-database/scripts/reactome_query.py
@@ -0,0 +1,286 @@
+#!/usr/bin/env python3
+"""
+Reactome Database Query Helper Script
+
+This script provides convenient command-line access to common Reactome operations.
+
+Usage:
+    python reactome_query.py version
+    python reactome_query.py query <pathway_id>
+    python reactome_query.py analyze <gene_list_file>
+    python reactome_query.py search <term>
+    python reactome_query.py entities <pathway_id>
+
+Examples:
+    python reactome_query.py version
+    python reactome_query.py query R-HSA-69278
+    python reactome_query.py analyze genes.txt
+    python reactome_query.py search "cell cycle"
+    python reactome_query.py entities R-HSA-69278
+"""
+
+import sys
+import json
+import requests
+from typing import List, Dict, Optional
+
+
+class ReactomeClient:
+    """Client for interacting with Reactome REST APIs"""
+
+    CONTENT_BASE = "https://reactome.org/ContentService"
+    ANALYSIS_BASE = "https://reactome.org/AnalysisService"
+
+    def get_version(self) -> str:
+        """Get Reactome database version"""
+        response = requests.get(f"{self.CONTENT_BASE}/data/database/version")
+        response.raise_for_status()
+        return response.text.strip()
+
+    def query_pathway(self, pathway_id: str) -> Dict:
+        """Query pathway information by ID"""
+        response = requests.get(f"{self.CONTENT_BASE}/data/query/{pathway_id}")
+        response.raise_for_status()
+        return response.json()
+
+    def get_pathway_entities(self, pathway_id: str) -> List[Dict]:
+        """Get participating entities in a pathway"""
+        response = requests.get(
+            f"{self.CONTENT_BASE}/data/event/{pathway_id}/participatingPhysicalEntities"
+        )
+        response.raise_for_status()
+        return response.json()
+
+    def search_pathways(self, term: str) -> List[Dict]:
+        """Search for pathways by name"""
+        response = requests.get(
+            f"{self.CONTENT_BASE}/data/query",
+            params={"name": term}
+        )
+        response.raise_for_status()
+        return response.json()
+
+    def analyze_genes(self, gene_list: List[str]) -> Dict:
+        """Perform pathway enrichment analysis on gene list"""
+        data = "\n".join(gene_list)
+        response = requests.post(
+            f"{self.ANALYSIS_BASE}/identifiers/",
+            headers={"Content-Type": "text/plain"},
+            data=data
+        )
+        response.raise_for_status()
+        return response.json()
+
+    def get_analysis_by_token(self, token: str) -> Dict:
+        """Retrieve analysis results by token"""
+        response = requests.get(f"{self.ANALYSIS_BASE}/token/{token}")
+        response.raise_for_status()
+        return response.json()
+
+
+def print_json(data):
+    """Pretty print JSON data"""
+    print(json.dumps(data, indent=2))
+
+
+def command_version():
+    """Get and display Reactome version"""
+    client = ReactomeClient()
+    version = client.get_version()
+    print(f"Reactome Database Version: {version}")
+
+
+def command_query(pathway_id: str):
+    """Query and display pathway information"""
+    client = ReactomeClient()
+    try:
+        pathway = client.query_pathway(pathway_id)
+        print(f"Pathway: {pathway['displayName']}")
+        print(f"ID: {pathway['stId']}")
+        print(f"Type: {pathway['schemaClass']}")
+
+        if 'species' in pathway and pathway['species']:
+            species = pathway['species'][0]['displayName']
+            print(f"Species: {species}")
+
+        if 'summation' in pathway and pathway['summation']:
+            summation = pathway['summation'][0]['text']
+            print(f"\nDescription: {summation}")
+
+        print("\nFull JSON response:")
+        print_json(pathway)
+
+    except requests.HTTPError as e:
+        if e.response.status_code == 404:
+            print(f"Error: Pathway '{pathway_id}' not found")
+        else:
+            print(f"Error: {e}")
+        sys.exit(1)
+
+
+def command_entities(pathway_id: str):
+    """Display entities participating in a pathway"""
+    client = ReactomeClient()
+    try:
+        entities = client.get_pathway_entities(pathway_id)
+        print(f"Entities in pathway {pathway_id}: {len(entities)} total\n")
+
+        # Group by type
+        by_type = {}
+        for entity in entities:
+            entity_type = entity['schemaClass']
+            if entity_type not in by_type:
+                by_type[entity_type] = []
+            by_type[entity_type].append(entity)
+
+        # Display by type
+        for entity_type, entities_list in sorted(by_type.items()):
+            print(f"{entity_type} ({len(entities_list)}):")
+            for entity in entities_list[:10]:  # Show first 10
+                print(f"  - {entity['stId']}: {entity['displayName']}")
+            if len(entities_list) > 10:
+                print(f"  ... and {len(entities_list) - 10} more")
+            print()
+
+    except requests.HTTPError as e:
+        if e.response.status_code == 404:
+            print(f"Error: Pathway '{pathway_id}' not found")
+        else:
+            print(f"Error: {e}")
+        sys.exit(1)
+
+
+def command_search(term: str):
+    """Search for pathways by term"""
+    client = ReactomeClient()
+    try:
+        results = client.search_pathways(term)
+        print(f"Search results for '{term}': {len(results)} found\n")
+
+        for result in results[:20]:  # Show first 20
+            print(f"{result['stId']}: {result['displayName']}")
+            if 'species' in result and result['species']:
+                species = result['species'][0]['displayName']
+                print(f"  Species: {species}")
+            print(f"  Type: {result['schemaClass']}")
+            print()
+
+        if len(results) > 20:
+            print(f"... and {len(results) - 20} more results")
+
+    except requests.HTTPError as e:
+        print(f"Error: {e}")
+        sys.exit(1)
+
+
+def command_analyze(gene_file: str):
+    """Perform pathway enrichment analysis"""
+    client = ReactomeClient()
+
+    # Read gene list
+    try:
+        with open(gene_file, 'r') as f:
+            genes = [line.strip() for line in f if line.strip()]
+    except FileNotFoundError:
+        print(f"Error: File '{gene_file}' not found")
+        sys.exit(1)
+
+    print(f"Analyzing {len(genes)} genes...")
+
+    try:
+        result = client.analyze_genes(genes)
+
+        # Display summary
+        summary = result['summary']
+        print(f"\nAnalysis Type: {summary['type']}")
+        print(f"Token: {summary['token']} (valid for 7 days)")
+        print(f"Species: {summary.get('species', 'N/A')}")
+
+        # Display pathways
+        pathways = result.get('pathways', [])
+        print(f"\nEnriched Pathways: {len(pathways)} found")
+
+        # Show significant pathways (FDR < 0.05)
+        significant = [p for p in pathways if p['entities']['fdr'] < 0.05]
+        print(f"Significant (FDR < 0.05): {len(significant)}\n")
+
+        # Display top 10 pathways
+        print("Top 10 Pathways:")
+        for i, pathway in enumerate(pathways[:10], 1):
+            print(f"\n{i}. {pathway['name']}")
+            print(f"   ID: {pathway['stId']}")
+            entities = pathway['entities']
+            print(f"   Found: {entities['found']}/{entities['total']} entities")
+            print(f"   p-value: {entities['pValue']:.6e}")
+            print(f"   FDR: {entities['fdr']:.6e}")
+
+        # Generate browser URL for top pathway
+        if pathways:
+            token = summary['token']
+            top_pathway = pathways[0]['stId']
+            url = f"https://reactome.org/PathwayBrowser/#{top_pathway}&DTAB=AN&ANALYSIS={token}"
+            print(f"\nView top result in browser:")
+            print(url)
+
+        # Save full results
+        output_file = gene_file.replace('.txt', '_results.json')
+        with open(output_file, 'w') as f:
+            json.dump(result, f, indent=2)
+        print(f"\nFull results saved to: {output_file}")
+
+    except requests.HTTPError as e:
+        print(f"Error: {e}")
+        sys.exit(1)
+
+
+def print_usage():
+    """Print usage information"""
+    print(__doc__)
+
+
+def main():
+    if len(sys.argv) < 2:
+        print_usage()
+        sys.exit(1)
+
+    command = sys.argv[1].lower()
+
+    if command == "version":
+        command_version()
+
+    elif command == "query":
+        if len(sys.argv) < 3:
+            print("Error: pathway_id required")
+            print("Usage: python reactome_query.py query <pathway_id>")
+            sys.exit(1)
+        command_query(sys.argv[2])
+
+    elif command == "entities":
+        if len(sys.argv) < 3:
+            print("Error: pathway_id required")
+            print("Usage: python reactome_query.py entities <pathway_id>")
+            sys.exit(1)
+        command_entities(sys.argv[2])
+
+    elif command == "search":
+        if len(sys.argv) < 3:
+            print("Error: search term required")
+            print("Usage: python reactome_query.py search <term>")
+            sys.exit(1)
+        command_search(" ".join(sys.argv[2:]))
+
+    elif command == "analyze":
+        if len(sys.argv) < 3:
+            print("Error: gene list file required")
+            print("Usage: python reactome_query.py analyze <gene_list_file>")
+            sys.exit(1)
+        command_analyze(sys.argv[2])
+
+    else:
+        print(f"Error: Unknown command '{command}'")
+        print_usage()
+        sys.exit(1)
+
+
+if __name__ == "__main__":
+    main()