TL;DR

Exopriors is publishing a Claude Code prompt and public key that let Claude query a large, growing index of research and community writing relevant to the intelligence-explosion topic. The setup exposes SQL-style and vector-algebra search primitives over hundreds of gigabytes of indexed content; full features and private vectors require an account.

What happened

Exopriors published a ready-to-use prompt and public key intended for use with Claude Code that grants the model direct access to a large document index covering sources such as arXiv, Hacker News, LessWrong and community-archive.org. The service exposes both traditional lexical search (BM25) and vector-based semantic search, plus SQL-like queries and vector algebra operations (for example, cosine-distance calculations, vector mixing, centroids and temporal deltas). The site provides instructions for using the prompt inside the Claude web app — including enabling code execution, file creation and network egress and adding api.exopriors.com to a domain allowlist — and offers an example SQL/vector query for tracing discussion of mesa-optimization since 2023. Exopriors lists the index size and scale (65M+ documents, 22M+ embeddings, 600GB+ indexes) and says basic access is free for researchers with 1.5M embedding tokens included; creating an account unlocks higher limits and private vectors. The project also highlights security tradeoffs and possible prompt-injection risks when allowing models to call external APIs.

Why it matters

  • Provides researchers programmatic access to a large, curated corpus combining preprints and community discussion, enabling cross-source queries.
  • Combining SQL-like filters with vector algebra supports nuanced semantic searches and hybrid lexical/semantic ranking strategies.
  • Embedding storage and server-side vectors reduce context limits in the client, enabling larger-scale, persistent search workflows.
  • Granting API/network permissions to a model raises operational security and prompt-injection considerations that users must weigh.

Key facts

  • Exopriors supplies a Claude Code prompt plus a public API key to query their index.
  • Indexed sources named on the site include arXiv, Hacker News, LessWrong and community-archive.org.
  • Search primitives include BM25 lexical search, semantic embeddings, hybrid re-ranking and SQL-like querying over embeddings.
  • Vector operations demonstrated include cosine-distance (<=>), stored vectors, vector mixing, centroids and temporal deltas.
  • The site reports 65M+ documents, 22M+ embeddings and 600GB+ of indexed data.
  • Free researcher tier includes 1.5M embedding tokens; an account is required for higher limits and private vectors.
  • Instructions are provided for Claude web: enable code execution and file creation, allow network egress, and add api.exopriors.com to the domain allowlist.
  • Exopriors recommends some convenience flags for Claude that skip permissions but explicitly warns about the associated risks and prompt-injection exposure.

What to watch next

  • not confirmed in the source: whether the index will expand to include additional or proprietary data sources beyond those listed.
  • not confirmed in the source: how the platform will handle access controls, billing tiers, and long-term storage of private vectors.
  • not confirmed in the source: community adoption rates, third-party integrations, or formal security audits.

Quick glossary

  • Vector embedding: A numeric representation of text (or other data) in high-dimensional space used to measure semantic similarity.
  • Cosine similarity: A metric that measures the cosine of the angle between two vectors; commonly used to compare embedding similarity.
  • BM25: A probabilistic bag-of-words ranking function used for lexical search and relevance scoring in information retrieval.
  • Hybrid search: A search approach that combines lexical methods (like BM25) with semantic re-ranking using embeddings.
  • Prompt injection: A class of attacks where adversarial inputs manipulate a language model or its agents to perform unintended actions or disclose information.

Reader FAQ

How can I start querying the index with Claude?
Paste the provided prompt and public key into Claude Code or the Claude web app; for the web app the site instructs enabling code execution, file creation and network egress and adding api.exopriors.com to the domain allowlist.

Is there a free tier?
Yes — the site states there is a free researcher tier that includes 1.5M embedding tokens.

Do I need an account for private vectors and higher limits?
The site says creating an account is required to get full functionality, higher limits and private vectors.

Should I use the suggested '–dangerously-skip-permissions' flag?
The site recommends that flag for ergonomics but also warns about the security risks and potential for prompt-injection; users must accept that risk themselves.

Which data sources are included in the index?
Listed sources include arXiv, Hacker News, LessWrong and community-archive.org; additional sources are described as 'etc.'

Ask unprecedentedly nuanced questions. We give you and Claude full ARBITRARY SQL + VECTOR ALGEBRA search power over a growing index of documents relevant to the intelligence explosion. ◇ arXiv…

Sources

Related posts

By

Leave a Reply

Your email address will not be published. Required fields are marked *