Question 9

Question

A Generative Al Engineer is tasked with developing a RAG application that will help a small internal group of experts at their company answer specific questions, augmented by an internal knowledge base. They want the best possible quality in the answers, and neither latency nor throughput is a

huge concern given that the user group is small and they’re willing to wait for the best answer. The topics are sensitive in nature and the data is highly confidential and so, due to regulatory requirements, none of the information is allowed to be transmitted to third parties. Which model meets all the Generative Al Engineer’s needs in this situation?

Vikram V. · Answer

DBRX INSTRUCTThis one nails all the requirements since you can deploy it entirely on your own Databricks setup, so no confidential info leaves your infrastructure. It's also at the top tier for open-weight LLMs quality-wise, which matters here since latency and throughput aren't priority. I think it's the most compliant choice for high-sensitivity cases like this, unless someone knows a better private LLM option?

Sam R. · Answer

If we leave out latency and cost, and the model must stay fully in-house for compliance, does Llama2-70B really match up to DBRX Instruct for answer quality in a Databricks-centric environment?

QuietAuditor6786 · Answer

Yeah, DBRX INSTRUCT fits perfectly here. Only it gives you strong answer quality without any external API calls or third-party risk, so checks all the boxes for regulatory needs. Not 100 percent sure if there's an even bigger on-prem model around, but for Databricks setups this is the one.

Sean V. · Answer

DBRX Instruct, since it's open weights and can be run fully on-premise so no sensitive data leaves your environment. Good balance of quality and compliance here.

Ajay W. · Answer

BGE-large

Premium Access Includes

FLASH OFFER

avail 10% DISCOUNT on YOUR PURCHASE