Data

Data pipeline

Rebuild the corpus straight from the web: pull Indian startups from Wikipedia and Y Combinator, embed each, and load into Postgres — no terminal required.

~110 companies · ≈1–2 min

Run the pipeline to (re)populate the corpus from live sources.