47 captures
21 Dec 2020 - 30 Mar 2025
May
JUL
Sep
27
2020
2021
2022
success
fail
About this capture
COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
Collection:
github.com
TIMESTAMPS
The Wayback Machine - https://web.archive.org/web/20210727110827/https://github.com/huggingface/datasets/discussions
Skip to content
Sign up
Why GitHub?
Features
→
Mobile
→
Actions
→
Codespaces
→
Packages
→
Security
→
Code review
→
Issues
→
Integrations
→
GitHub Sponsors
→
Customer stories
→
Team
Enterprise
Explore
Explore GitHub
→
Learn and contribute
Topics
→
Collections
→
Trending
→
Learning Lab
→
Open source guides
→
Connect with others
The ReadME Project
→
Events
→
Community forum
→
GitHub Education
→
GitHub Stars program
→
Marketplace
Pricing
Plans
→
Compare plans
→
Contact Sales
→
Education
→
In this repository
All GitHub
↵
Jump to
↵
No suggested jump to results
In this repository
All GitHub
↵
Jump to
↵
In this organization
All GitHub
↵
Jump to
↵
In this repository
All GitHub
↵
Jump to
↵
Sign in
Sign up
{{ message }}
huggingface
/
datasets
Notifications
Star
8.7k
Fork
1.1k
Code
Issues
280
Pull requests
49
Discussions
Actions
Projects
1
Wiki
Security
Insights
More
Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights
New
Top:
All
Today
Past week
Past month
Past year
All
Answered
Unanswered
Label
Filter by label
Categories
View all
#️⃣
General
💡
Ideas
🙏
Q&A
🙌
Show and tell
Code of conduct
huggingface.co/docs/datasets
Beta
You must be logged in to vote
1
🙏
How can the SARI metric be used in a Seq2SeqTrainer compute_metric parameter
eytan-c
asked
Jul 24, 2021
in
Q&A
· Unanswered
0
You must be logged in to vote
1
🙏
Can dataset.map return batch with batched=False?
cheulyop
asked
Jul 16, 2021
in
Q&A
· Unanswered
0
You must be logged in to vote
1
🙏
Using IterableDataset with Torch DataLoader throws error.
LeenaShekhar
asked
Jul 1, 2021
in
Q&A
· Unanswered
2
You must be logged in to vote
1
🙏
Sharing datasets cache across different users in linux
halean
asked
Jun 9, 2021
in
Q&A
· Unanswered
0
You must be logged in to vote
1
💡
Descriptive & aggregate metrics
BenoitDalFerro
asked
May 21, 2021
in
Ideas
· Unanswered
0
You must be logged in to vote
1
#️⃣
Amharic Support: Thread for those interested
yosiasz
asked
Feb 19, 2021
in
General
· Unanswered
6
You must be logged in to vote
2
🙏
Is it possible to only download a fraction size of the dataset?
mariusjohan
asked
Feb 16, 2021
in
Q&A
· Unanswered
1
You must be logged in to vote
2
🙏
How can I split/reduce the SQUAD dataset?
Nomiluks
asked
Mar 1, 2021
in
Q&A
· Unanswered
2
You must be logged in to vote
1
#️⃣
`big_patent` dataset isn't available on version 1.1.3
haridas
asked
Dec 20, 2020
in
General
· Unanswered
1
You must be logged in to vote
1
#️⃣
MAP for transformers datasets
ziwang-com
asked
Mar 22, 2021
in
General
· Unanswered
0
You must be logged in to vote
2
🙌
HuggingFace 🤗 Transformers is in Top 10 Python packages with the most unique contributors over the last 12 months on GitHub
8bitmp3
asked
Jan 6, 2021
in
Show and tell
· Unanswered
2
You can’t perform that action at this time.
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.