Audit evaluator scores
LLM-as-a-judge evaluators don't always get it right. Because of this, it is often useful for a human to manually audit the scores left by an evaluator and correct them where necessary. LangSmith allows you to make corrections on evaluator scores in the UI or SDK.
In the comparison view
In the comparison view, you may click on any feedback tag to bring up the feedback details. From there, click the "edit" icon on the right to bring up the corrections view.
In the runs table
In the runs table, find the "Feedback" column and click on the feedback tag to bring up the feedback details. Again, click the "edit" icon on the right to bring up the corrections view.
In the SDK
Corrections can be made via the SDK's update_feedback
function, with the correction
dict. You must specify a score
key which corresponds to a number for it to be rendered in the UI.
- Python
- TypeScript
import langsmith
client = langsmith.Client()
client.update_feedback(
my_feedback_id,
correction={
"score": 1,
},
)
import { Client } from 'langsmith';
const client = new Client();
await client.updateFeedback(
myFeedbackId,
{
correction: {
score: 1,
}
}
)