How to make the most out of LLM production data: simulated user feedback towardsdatascience.com Post date April 11, 2024 No Comments on How to make the most out of LLM production data: simulated user feedback External Tags GenAI, LLM, llm-evaluation, llmops, Simulation
Building a Math Application with LangChain Agents towardsdatascience.com Post date March 19, 2024 No Comments on Building a Math Application with LangChain Agents External Tags chainlit, hands-on-tutorials, langchain, langchain-agents, llm-evaluation
Top Evaluation Metrics for RAG Failures towardsdatascience.com Post date February 2, 2024 No Comments on Top Evaluation Metrics for RAG Failures External Tags evaluation-metric, large-language-models, llm-evaluation, llmops, retrieval-augmented
Calling All Functions towardsdatascience.com Post date December 7, 2023 No Comments on Calling All Functions External Tags calling-function, gpt-4, large-language-models, llm-evaluation, openai
Steady the Course: Navigating the Evaluation of LLM-based Applications towardsdatascience.com Post date November 9, 2023 No Comments on Steady the Course: Navigating the Evaluation of LLM-based Applications External Tags artificial-intelligence, LLM, LLM applications, llm-evaluation, machine-learning
LLM Evals: Setup and the Metrics That Matter towardsdatascience.com Post date October 13, 2023 No Comments on LLM Evals: Setup and the Metrics That Matter External Tags hands-on-tutorials, llm-evaluation, llmops, observability, open-ai-api