While Tokasaurus’s Async-TP shows impressive throughput gains, it seems over-eng...

YetAnotherNick · 2025-06-05T22:52:11 1749163931

Depends on what production means for you. This is useful for batch production jobs.

Also, this seems very useful for generating synthetic data or labelling a bunch of data. 6k batch size is small for data labelling.

cpard · 2025-06-06T04:10:39 1749183039

How big of a use case is synthetic data generation? I’m curious as I see a lot about it coming from academic projects but I haven’t seen much related to commercial use cases

electroglyph · 2025-06-06T05:35:40 1749188140

tiny NNs distilled from LLMs can produce some amazing results, i'm surprised it's not more common tbh

cpard · 2025-06-06T05:52:07 1749189127

I agree, there are impressive results. This just came out from Berkeley https://arxiv.org/abs/2506.04178

But still, I mainly see work on this direction in academia.

bjt12345 · 2025-06-05T22:34:08 1749162848

Buy surely next years production deployments will be very different to right now, with different use cases...etc

jdiff · 2025-06-05T23:46:40 1749167200

Sure. Things change over time. Is there a reason to believe they'd be different in such a way that this would be more useful than in today's landscape? I haven't seen such a forecast myself.