Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nguyenvulebinh
/
AV-HuBERT
like
7
Text Generation
Transformers
Safetensors
speech_to_text
arxiv:
2303.00628
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
AV-HuBERT
2.17 GB
1 contributor
History:
8 commits
nguyenvulebinh
Update README.md
bd55937
verified
10 months ago
.gitattributes
1.59 kB
add lips model for inference
over 1 year ago
20words_mean_face.npy
1.17 kB
xet
add lips model for inference
over 1 year ago
README.md
5.77 kB
Update README.md
10 months ago
config.json
3.78 kB
Update config.json
over 1 year ago
finetuning_TalkSet.model
4.18 MB
xet
Upload 3 files
about 1 year ago
generation_config.json
189 Bytes
Upload AV2TextForConditionalGeneration
over 1 year ago
model.safetensors
1.92 GB
xet
Upload AV2TextForConditionalGeneration
over 1 year ago
sentencepiece.bpe.model
253 kB
xet
Upload tokenizer
over 1 year ago
sfd_face.pth
89.8 MB
xet
Upload 3 files
about 1 year ago
shape_predictor_68_face_landmarks.dat
99.7 MB
xet
add lips model for inference
over 1 year ago
special_tokens_map.json
552 Bytes
Upload tokenizer
over 1 year ago
syncnet_v2.model
54.6 MB
xet
Upload 3 files
about 1 year ago
tokenizer_config.json
1.09 kB
Upload tokenizer
over 1 year ago
vocab.json
20.1 kB
Upload tokenizer
over 1 year ago