Creating a Python Evaluator - Blog

Dhruv Singh

Updated: Mar 18, 2024

Description

Click through a step-by-step, interactive demo walkthrough of Honeyhive, powered by Supademo.

Steps

Select which event type you want to run your evaluator on. Click on "Model".

Let's setup an evaluator for the LLM step. Click on "Model".

Now, let's specific the model event to run your evaluator on. Click on "All Completions".

Select "Model Completion" as the event.

Next, click on "Show Schema" to review what fields are available.

Now, click on "content" under `outputs` to add that field to your evaluator.

Following that, we can paste the reference to that field in the "Console" code.

Next up, we'll add a simple `len(model_response)` to get the length of response

Click on "Get datapoints" to test your new evaluator against production logs.

Video step

After that, click on "Run all".

Let's add a simple passing range for what the minimum & maximum length should be.

Hit the toggle under "Enable In Production" to enable it in production

Click on "Advanced Settings".

Set a sampling percentage based on how expensive the evaluator might be to run.

Click on "Create" to create your evaluator!

Your evaluator is now enabled over production logs!

Enjoyed the guided demo?