Data pipeline
Rebuild the corpus straight from the web: pull Indian startups from Wikipedia and Y Combinator, embed each, and load into Postgres — no terminal required.
~110 companies · ≈1–2 min
Run the pipeline to (re)populate the corpus from live sources.