Pedro Rodrigues 9b08864e94 feat(evals): replace mock CLIs with real Supabase instance per eval run
Start a shared local Supabase stack once before all scenarios and reset
the database (drop/recreate public schema + clear migration history) between
each run. This lets agents apply migrations via `supabase db push` against a
real Postgres instance instead of mock shell scripts.

- Add supabase-setup.ts: startSupabase / stopSupabase / resetDB / getKeys
- Update runner.ts to start/stop Supabase and inject keys into process.env
- Update agent.ts to point MCP config at the local Supabase HTTP endpoint
- Update preflight.ts to check supabase CLI availability and Docker socket
- Update scaffold.ts to seed workspace with supabase/config.toml
- Add passThreshold support (test.ts / results.ts / types.ts) for partial pass
- Delete mock shell scripts (mocks/docker, mocks/psql, mocks/supabase)
- Update Dockerfile/docker-compose to mount Docker socket for supabase CLI

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-25 14:39:54 +00:00
2026-01-22 08:03:02 +00:00
2026-01-23 12:06:36 +00:00
2026-02-18 12:02:28 +00:00
2026-01-16 14:33:29 +07:00
2026-02-09 17:05:45 +00:00
2026-02-09 17:05:45 +00:00

Supabase Agent Skills

Supabase Agent Skills

Agent Skills to help developers using AI agents with Supabase. Agent Skills are folders of instructions, scripts, and resources that agents like Claude Code, Cursor, Github Copilot, etc... can discover and use to do things more accurately and efficiently.

The skills in this repo follow the Agent Skills format.

Installation

npx skills add supabase/agent-skills

Claude Code Plugin

You can also install the skills in this repo as Claude Code plugins

/plugin marketplace add supabase/agent-skills
/plugin install postgres-best-practices@supabase-agent-skills

Available Skills

supabase-postgres-best-practices

Postgres performance optimization guidelines from Supabase. Contains references across 8 categories, prioritized by impact.

Use when:

  • Writing SQL queries or designing schemas
  • Implementing indexes or query optimization
  • Reviewing database performance issues
  • Configuring connection pooling or scaling
  • Working with Row-Level Security (RLS)

Categories covered:

  • Query Performance (Critical)
  • Connection Management (Critical)
  • Schema Design (High)
  • Concurrency & Locking (Medium-High)
  • Security & RLS (Medium-High)
  • Data Access Patterns (Medium)
  • Monitoring & Diagnostics (Low-Medium)
  • Advanced Features (Low)

Usage

Skills are automatically available once installed. The agent will use them when relevant tasks are detected.

Examples:

Optimize this Postgres query
Review my schema for performance issues
Help me add proper indexes to this table

Skill Structure

Each skill follows the Agent Skills Open Standard:

  • SKILL.md - Required skill manifest with frontmatter (name, description, metadata)
  • AGENTS.md - Compiled references document (generated)
  • references/ - Individual reference files
Description
No description provided
Readme MIT 2.4 MiB
Languages
TypeScript 100%