Vivek Sriram
Chief Product Officer @ Bookend AI
The rush to AI-enable everything is understandable. No one wants to be the last business to figure out the obvious. Yet, this rapid mass embrace of immature, brittle tools which are frequently not ready for primetime is causing no shortage of heartburn for those in enterprise Information Technology. Three anecdotes serve to underscore the severity of the problem.
- Security / inadvertent exposure of private data. Despite some warnings to not put confidential / private information into ChatGPT, people frequently take confidential data and stick it into ChatGPT. What could go wrong? Well, ChatGPT might leak some of that data due to some services it in turn uses. No doubt OpenAI is a well run, professional organization, with a quick response, but what about all the other Open AI clones there?
- Operations / observability. The current stacks in wide use now aren’t really all that well suited for a new LLM-powered everything world. While there are plenty of monitoring and observability tools out there, the key consideration is in addressing the nuances specific to LLM-powered apps. That as of now is almost non-existent.
- Cost and performance. GPUs are expensive, and sometimes scarce. Training LLMs is cumbersome, complicated and costly. Per Clement Delangue, the CEO Hugging Face: the process of training the company’s Bloom large language model took more than two-and-a-half months and required access to a supercomputer that was “something like the equivalent of 500 GPUs.”
Recent Insights
Experience Intelligence – Investigation
Speakers Description In this webinar we discuss MC+A’s new solution and approach for Intelligence Experiences for investigation use cases using LLM technology and machine learning. We demo how the solutions can act as a catalyst in expediting investigative processes. LLM technology assist with investigation due to its ability to understand, interpret, and analyze vast swathes of data, thereby aiding in
Comparing Performance of OpenAI GPT-4 and Microsoft Azure GPT-4
In this article, we’ll compare the performance of OpenAI’s API versus Microsoft’s API when utilizing GPT-4.
E-commerce Relevancy Improving B2B with Vectors
Join our panelists for a webinar where they discuss approaches for improving relevance for e-commerce search. They will cover ELAND and ELSER, promising to reshape the relevance landscape with vector-based search. Don’t miss out on an interesting discussion that could change your approach to e-commerce search relevancy.
Go Further with Expert Consulting
Launch your technology project with confidence. Our experts allow you to focus on your project’s business value by accelerating the technical implementation with a best practice approach. We provide the expert guidance needed to enhance your users’ search experience, push past technology roadblocks, and leverage the full business potential of search technology.