8 Comments
User's avatar
Daniel Olshansky's avatar

Such a great post. So many thoughts!

1. This post distills years of intuition around the role of data in a short read.

2. Curious if “golden data sets” are still better better than “large data”. If you can’t have quality and quantity, is small quantity enough?

3. Personally, I think agentic stablecoin payments are going to serve as a strong signal (label?) for high quality content in the future. The de-slopifier.

4. It’s early, but pre-construction is where I believe the next large untapped data opportunity will lie.

Pascal Walter's avatar

Is coming french quality 🤣😊

MadeAi's avatar

Loved this take, feels very relevant right now. The way you’ve framed it makes the whole topic feel a lot more practical than theoretical.

Martin Alen's avatar

This is excellent - thanks.

If models and compute have labs, and data now needs one too, there’s a deeper layer worth naming: how human signal becomes data in the first place.

Most datasets are built from behavioural exhaust, scraped artifacts, and inferred intent. By the time data reaches a “lab,” it’s already distorted by weak grounding, misaligned incentives, and performative outputs.

The bottleneck may not just be data quality. It may be upstream signal integrity.

AI labs will need structured, permissioned, constraint-aware human input not just better curation of noisy traces. There’s an opportunity to design infrastructure for declared intent before it collapses into downstream noise.

If you’re thinking about a data lab, it’s worth asking: what would a protocol for high-fidelity human-origin signal look like and who builds that layer?

Might be us - https://s7tlabs.com/

NFT News's avatar

An AI Lab for data would be a dream for government agencies and big corporations. Fronteir AI would be perfect for setting up and running a AI Lab for Data centers.

Frontier AI: https://promptengineer-1.weebly.com/frontier-ai.html

Opinion AI's avatar

Strong point. We talk a lot about models and chips, but bad data still breaks everything.

If the data layer stays weak, the smartest model will still fail in real world work

8Lee's avatar

“The world needs… this startup we’ve invested in!”

But seriously. We do need more of these.

Isaac Sarfo's avatar

Exactly what my team and I are building at Papermap.ai, great read!