Deepmark AI - LLM benchmarking tool for task-specific metrics on your data

zalran (64)in #steemhunt • last year

Deepmark AI

LLM benchmarking tool for task-specific metrics on your data

Screenshots

Hunter's comment

Deepmark AI is a benchmarking tool that enables assessment of several large language models (LLM) on various extrinsic (task-specific) metrics (e.g. accuracy, relevance, failure rate, latency, etc) on your own data, so your AI apps have reliable performance.

Link

https://github.com/IngestAI/deepmark

This is posted on Steemhunt - A place where you can dig products and earn STEEM.
View on Steemhunt.com

last year in #steemhunt by zalran (64)

Sort:

steemhunt (76) last year

Congratulations!

We have upvoted your post for your contribution within our community.
Thanks again and look forward to seeing your next hunt!

Want to chat? Join us on:

Discord: https://discord.gg/mWXpgks
Telegram: https://t.me/joinchat/AzcqGxCV1FZ8lJHVgHOgGQ

$0.00