SemanticScuttle - klotz.me » Tags: evaluation+python+large language models

Tags: evaluation* + python* + large language models*

0 bookmark(s) - Sort by: Date ↓ / Title /

asta-paper-finder

frozen-in-time version of our Paper Finder agent for reproducing evaluation results. This repo contains the code for the standalone Paper Finder agent. PaperFinder is our paper-seeking agent, which is intended to assist in locating sets of papers according to content-based and metadata criteria.

2025-08-26 Tags: paper finder, agent, llm, research papers, evaluation, python by klotz
LLM Evaluation

This GitHub repository directory contains resources for evaluating Large Language Models (LLMs), including a Jupyter Notebook demonstrating how to use LLM Arena as a judge and a Python script for the same purpose. It also includes a README file with instructions on how to view the notebook if it doesn't render correctly on GitHub.

2025-08-26 Tags: llm, evaluation, large language models, llm arena, jupyter notebook, python, ai, github by klotz

First / Previous / Next / Last / Page 1 of 0