Developer on the trulens team here. We just put out a pretty cool new release (0.1.2). Integration with huggingface pipelines in langchain and asynchronous feedback management are just some of the useful updates in this release.
Real evaluations (maybe evaluated by LLMs themselves) instead of just vibe checks is the next big step we need to take as an industry.
Learn more about TruLens here if you're curious: https://medium.com/trulens/evaluate-and-track-your-llm-exper...
Docs and other resources at: https://www.trulens.org/
Developer on the trulens team here. We just put out a pretty cool new release (0.1.2). Integration with huggingface pipelines in langchain and asynchronous feedback management are just some of the useful updates in this release.
Real evaluations (maybe evaluated by LLMs themselves) instead of just vibe checks is the next big step we need to take as an industry.
It's essential to assess language models based on their performance and capabilities rather than subjective measures like "vibes."
Any acceleration in the evaluation process is welcome and much, much needed. Looking forward to checking out TruLens.
Thanks! Would love to get any feedback.
Stop deploying LLMs to your users based on vibes. Use TruLens to evaluate your LLM apps
You guys are on fire