Skip to content

Activity

Add final annotations

steffenkleinlepushed 1 commit to main • 6377c1b…b6c1f48 • 
on Jan 13

Add missing column

steffenkleinlepushed 1 commit to main • ef4e336…6377c1b • 
on Sep 27, 2024

Improve readme

steffenkleinlepushed 1 commit to main • f0f72be…ef4e336 • 
on Sep 27, 2024

Deleted branch

steffenkleinledeleted llm-evaluation • 
on Sep 27, 2024

Merge pull request #7 from digitalfabrik/llm-evaluation

Pull request merge
steffenkleinlepushed 48 commits to main • 57cbbbd…f0f72be • 
on Sep 27, 2024

Fix readme

steffenkleinlepushed 1 commit to llm-evaluation • 7ec21c8…6166efd • 
on Sep 27, 2024

Improve readme and rename frontend and backend

steffenkleinlepushed 1 commit to llm-evaluation • f7a5930…7ec21c8 • 
on Sep 27, 2024

Add readme and tidy up

steffenkleinlepushed 1 commit to llm-evaluation • 6c0c9ef…f7a5930 • 
on Sep 26, 2024

More cross-language numbers

steffenkleinlepushed 1 commit to llm-evaluation • 86be6d5…6c0c9ef • 
on Jul 8, 2024

LLM unanswerability numbers

steffenkleinlepushed 1 commit to llm-evaluation • 4669679…86be6d5 • 
on Jul 1, 2024

Unanswerable LLM

steffenkleinlepushed 1 commit to llm-evaluation • a9f8f10…4669679 • 
on Jul 1, 2024

Add more evaluation methods and new unanswerability prompt

steffenkleinlepushed 1 commit to llm-evaluation • 1ce31b5…a9f8f10 • 
on Jun 28, 2024

Cross lingual

steffenkleinlepushed 1 commit to llm-evaluation • d94f840…1ce31b5 • 
on Jun 28, 2024

Mixtral multilingual

steffenkleinlepushed 1 commit to llm-evaluation • 8199834…d94f840 • 
on Jun 27, 2024

llama8b multilingual

steffenkleinlepushed 1 commit to llm-evaluation • 3bb1595…8199834 • 
on Jun 27, 2024

Adjust numbers and include deberta

steffenkleinlepushed 1 commit to llm-evaluation • d453655…3bb1595 • 
on Jun 24, 2024

Add dev predictions for different context lengths

steffenkleinlepushed 1 commit to llm-evaluation • 8f0fe7a…d453655 • 
on Jun 24, 2024

Fix classifier

steffenkleinlepushed 1 commit to llm-evaluation • 4ccedf0…8f0fe7a • 
on Jun 21, 2024

Add numbers for context length 2

steffenkleinlepushed 2 commits to llm-evaluation • 51aaa90…4ccedf0 • 
on Jun 21, 2024

Improved postprocessing, postprocessing table and additional utilities

steffenkleinlepushed 1 commit to llm-evaluation • 71afa1b…51aaa90 • 
on Jun 21, 2024

More and more numbers

steffenkleinlepushed 1 commit to llm-evaluation • 20b3719…71afa1b • 
on Jun 18, 2024

Update numbers

steffenkleinlepushed 1 commit to llm-evaluation • 8eb97a8…20b3719 • 
on Jun 17, 2024

More and more numbers

steffenkleinlepushed 1 commit to llm-evaluation • fd53d33…8eb97a8 • 
on Jun 17, 2024

Add more numbers for test partition

steffenkleinlepushed 1 commit to llm-evaluation • c132f6a…fd53d33 • 
on Jun 17, 2024

More numbers

steffenkleinlepushed 1 commit to llm-evaluation • ff1c017…c132f6a • 
on Jun 15, 2024

Add more finetuning

steffenkleinlepushed 1 commit to llm-evaluation • 87fca00…ff1c017 • 
on Jun 15, 2024

Finetuning

steffenkleinlepushed 1 commit to llm-evaluation • 156f474…87fca00 • 
on Jun 15, 2024

Improve classifier

steffenkleinlepushed 1 commit to llm-evaluation • eafe020…156f474 • 
on Jun 14, 2024

Add more predictions

steffenkleinlepushed 1 commit to llm-evaluation • a9ddf73…eafe020 • 
on Jun 13, 2024

Add numbers for test partition

steffenkleinlepushed 1 commit to llm-evaluation • 6223c61…a9ddf73 • 
on Jun 12, 2024