The San Francisco-based startup has developed a novel LLM router, which allows enterprises to have multiple models in play and direct queries to the best one, improving not only the quality of outputs but also other usage-critical aspects such as overall latency and associated costs. “Our fundamental bet is that the future won’t have one single, giant model or company that everyone sends everything to—instead, there will be many foundation models, millions of fine-tuned variants of those models, and countless custom inference engines running on top of them. We started Not Diamond to enable this multi-model future, starting with the world’s most powerful infrastructure for routing between models,” Tomás Hernando Kofman, the CEO and co-founder of Not Diamond, said in a statement.
No comments:
Post a Comment