Commit History

Update README to reflect the new title and emoji for the Polish Cultural Vision Benchmark (PCVB).
1b358ee
Running

djstrong commited on

Update benchmark_report.json
5894291
verified

Sekon commited on

Update benchmark_report.json
81b4e63
verified

Sekon commited on

Update benchmark_report.json
607a458
verified

Sekon commited on

Comment out CSV file input in app.py to streamline data handling following recent refactor to JSON. This change enhances clarity and aligns with the updated data structure.
e19029d

djstrong commited on

Refactor app.py to use JSON for benchmark data, removing CSV and metadata dependencies. Update performance plotting to reflect new data structure and enhance visualization with cultural context. Introduce benchmark report JSON file for structured model evaluation results.
fd35185

djstrong commited on

Update metadata.json
74253ba
verified

djstrong commited on

Update benchmark_results.csv
7116436
verified

djstrong commited on

Update benchmark_results.csv
482318e
verified

djstrong commited on

Update benchmark_results.csv
1ed9658
verified

djstrong commited on

Update benchmark_results.csv
163f84b
verified

djstrong commited on

Update app.py
07dade5
verified

djstrong commited on

metadata
3e37028

djstrong commited on

new models
b314a79

djstrong commited on

Fix GPU precision label in performance plot
288816d

djstrong commited on

update
f820897

djstrong commited on

update gradio
32bb346

djstrong commited on

update
7a9f32a

djstrong commited on

generate static page leaderboard
235501b

djstrong commited on

Update metadata.json
a3a884e
verified

djstrong commited on

beilik v2.3
73b9490

djstrong commited on

asd
ba2508f

djstrong commited on

update eq-bench
672a5d6

djstrong commited on

Change the way the parsable questions are expressed from numerical to percentage (#1)
53db359
verified

djstrong Draedon commited on

Update README.md
55fd848
verified

djstrong commited on

Update README.md
c930c54
verified

djstrong commited on

Update README.md
1bf5879
verified

djstrong commited on

Update README.md
904f86f
verified

djstrong commited on

eq bench
277ca2e

djstrong commited on

eq bench
c645de7

djstrong commited on

eq bench
d6018f5

djstrong commited on

eq bench
492a075

djstrong commited on

eq bench
23d7973

djstrong commited on

eq bench
f2a3e70

djstrong commited on

eq bench
de1d88f

djstrong commited on

eq bench
bd5b131

djstrong commited on

eq bench
397694c

djstrong commited on

Update README.md
aa5756a
verified

djstrong commited on

initial commit
5741007
verified

djstrong commited on

Duplicate from demo-leaderboard-backend/leaderboard
87ad165
verified

djstrong clefourrier HF Staff commited on