Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
43
29
37
Hynek Kydlicek
hynky
Follow
Arshavir's profile picture
Mi6paulino's profile picture
yjernite's profile picture
179 followers
Β·
63 following
HKydlicek
hynky1999
AI & ML interests
Data-processing
Recent Activity
liked
a Space
4 days ago
lerobot/robot-folding
updated
a bucket
12 days ago
macrodata/test_bucket
published
a dataset
24 days ago
macrodata/aloha_motion
View all activity
Organizations
hynky
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
HuggingFaceFW/finepdfs
3 months ago
Question about data ordering/shuffling in the FinePDFs parquet files
1
#30 opened 3 months ago by
yujisw
Row count mismatch for the unknown languages subset
#25 opened 6 months ago by
ayushdg95
How to use this dataset to extract PDFs by subject?
π
1
1
#14 opened 7 months ago by
vgoklani
Can additional corpuses further train this model?
1
#13 opened 7 months ago by
fenjamin
Decontamination against benchmarks?
4
#11 opened 7 months ago by
jo-kn
MarCognity-AI for HuggingFaceFW/finepdfs
1
#23 opened 6 months ago by
elly99
New activity in
HuggingFaceFW/finepdfs
4 months ago
Which language detector did you use
4
#28 opened 4 months ago by
ming030890
The "file_path" data field appears to primarily contain cc-index paths rather than WARC paths.
9
#16 opened 7 months ago by
lnstrument
A Few Questions About the Implementation Details of the finepdfs Project
5
#24 opened 6 months ago by
yoliax
Dataset broken by latest update?
5
#27 opened 5 months ago by
Rijgersberg
What parameters can be configured in Docling?
1
#12 opened 7 months ago by
yoliax
Unable to load dataset
1
#9 opened 7 months ago by
iamgroot42
OCR or not classifier
7
#6 opened 7 months ago by
MaartenMarx
New activity in
HuggingFaceFW/finepdfs-edu
5 months ago
Scoring system
1
#2 opened 5 months ago by
eoggc
New activity in
HuggingFaceFW/finepdfs_lang_classification
6 months ago
datatrove
#2 opened 6 months ago by
hynky
New activity in
HuggingFaceFW/finepdfs
6 months ago
Deciding on extraction path
4
#10 opened 7 months ago by
Mdspike
Were the original PDFs saved?
11
#2 opened 7 months ago by
staghado
Docling output
1
#4 opened 7 months ago by
akreal
adfadvab
#15 opened 7 months ago by
Sora013
Github LInk or XGBoost Model
2
#22 opened 6 months ago by
richard-ac
Load more