| --- |
| language: |
| - en |
| tags: |
| - vision-language |
| - clip |
| - vilt |
| datasets: |
| - lil-lab/kilogram-data |
|
|
| --- |
| |
| KiloGram dataset and code repo: https://github.com/lil-lab/kilogram |
|
|
| Preprocessed training and evaluation data: https://huggingface.co/datasets/lil-lab/kilogram-data |
|
|
| # Citation |
|
|
| ```bibtex |
| @misc{ji2022abstractvisualreasoningtangram, |
| title={Abstract Visual Reasoning with Tangram Shapes}, |
| author={Anya Ji and Noriyuki Kojima and Noah Rush and Alane Suhr and Wai Keen Vong and Robert D. Hawkins and Yoav Artzi}, |
| year={2022}, |
| eprint={2211.16492}, |
| archivePrefix={arXiv}, |
| primaryClass={cs.CL}, |
| url={https://arxiv.org/abs/2211.16492}, |
| } |
| ``` |