Automating LLM as a judge with EvalForge and Weave | Seattle .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

September 26, 2024 · Seattle

EvalForge: Automating LLM Judge

This talk explores automating custom LLM evaluation criteria using EvalForge and Weave, enabling users to create and run bespoke, human-aligned assessments.

Overview
Links
Tech stack