TL;DR
Exopriors is publishing a Claude Code prompt and public key that let Claude query a large, growing index of research and community writing relevant to the intelligence-explosion topic. The setup exposes SQL-style and vector-algebra search primitives over hundreds of gigabytes of indexed content; full features and private vectors require an account.
What happened
Exopriors published a ready-to-use prompt and public key intended for use with Claude Code that grants the model direct access to a large document index covering sources such as arXiv, Hacker News, LessWrong and community-archive.org. The service exposes both traditional lexical search (BM25) and vector-based semantic search, plus SQL-like queries and vector algebra operations (for example, cosine-distance calculations, vector mixing, centroids and temporal deltas). The site provides instructions for using the prompt inside the Claude web app — including enabling code execution, file creation and network egress and adding api.exopriors.com to a domain allowlist — and offers an example SQL/vector query for tracing discussion of mesa-optimization since 2023. Exopriors lists the index size and scale (65M+ documents, 22M+ embeddings, 600GB+ indexes) and says basic access is free for researchers with 1.5M embedding tokens included; creating an account unlocks higher limits and private vectors. The project also highlights security tradeoffs and possible prompt-injection risks when allowing models to call external APIs.
Why it matters
- Provides researchers programmatic access to a large, curated corpus combining preprints and community discussion, enabling cross-source queries.
- Combining SQL-like filters with vector algebra supports nuanced semantic searches and hybrid lexical/semantic ranking strategies.
- Embedding storage and server-side vectors reduce context limits in the client, enabling larger-scale, persistent search workflows.
- Granting API/network permissions to a model raises operational security and prompt-injection considerations that users must weigh.
Key facts
- Exopriors supplies a Claude Code prompt plus a public API key to query their index.
- Indexed sources named on the site include arXiv, Hacker News, LessWrong and community-archive.org.
- Search primitives include BM25 lexical search, semantic embeddings, hybrid re-ranking and SQL-like querying over embeddings.
- Vector operations demonstrated include cosine-distance (<=>), stored vectors, vector mixing, centroids and temporal deltas.
- The site reports 65M+ documents, 22M+ embeddings and 600GB+ of indexed data.
- Free researcher tier includes 1.5M embedding tokens; an account is required for higher limits and private vectors.
- Instructions are provided for Claude web: enable code execution and file creation, allow network egress, and add api.exopriors.com to the domain allowlist.
- Exopriors recommends some convenience flags for Claude that skip permissions but explicitly warns about the associated risks and prompt-injection exposure.
What to watch next
- not confirmed in the source: whether the index will expand to include additional or proprietary data sources beyond those listed.
- not confirmed in the source: how the platform will handle access controls, billing tiers, and long-term storage of private vectors.
- not confirmed in the source: community adoption rates, third-party integrations, or formal security audits.
Quick glossary
- Vector embedding: A numeric representation of text (or other data) in high-dimensional space used to measure semantic similarity.
- Cosine similarity: A metric that measures the cosine of the angle between two vectors; commonly used to compare embedding similarity.
- BM25: A probabilistic bag-of-words ranking function used for lexical search and relevance scoring in information retrieval.
- Hybrid search: A search approach that combines lexical methods (like BM25) with semantic re-ranking using embeddings.
- Prompt injection: A class of attacks where adversarial inputs manipulate a language model or its agents to perform unintended actions or disclose information.
Reader FAQ
How can I start querying the index with Claude?
Paste the provided prompt and public key into Claude Code or the Claude web app; for the web app the site instructs enabling code execution, file creation and network egress and adding api.exopriors.com to the domain allowlist.
Is there a free tier?
Yes — the site states there is a free researcher tier that includes 1.5M embedding tokens.
Do I need an account for private vectors and higher limits?
The site says creating an account is required to get full functionality, higher limits and private vectors.
Should I use the suggested '–dangerously-skip-permissions' flag?
The site recommends that flag for ergonomics but also warns about the security risks and potential for prompt-injection; users must accept that risk themselves.
Which data sources are included in the index?
Listed sources include arXiv, Hacker News, LessWrong and community-archive.org; additional sources are described as 'etc.'
Ask unprecedentedly nuanced questions. We give you and Claude full ARBITRARY SQL + VECTOR ALGEBRA search power over a growing index of documents relevant to the intelligence explosion. ◇ arXiv…
Sources
- Show HN: Use Claude Code to Query 600 GB Indexes over Hacker News, ArXiv, etc.
- Show | Hacker News
- Announcing Scry: A research tool over arXiv, EA Forum, etc …
- Claude Code is all you need
Related posts
- We don’t need more contributors who aren’t programmers to contribute code
- Professional Software Developers Don’t Vibe, They Control: AI Agent Use in 2025
- Professional Developers Direct, Not ‘Vibe’: AI Agent Coding in 2025