Skip to content
This repository was archived by the owner on Nov 1, 2021. It is now read-only.
This repository was archived by the owner on Nov 1, 2021. It is now read-only.

many-examples: remove kaggle dependency #544

@alexcg1

Description

@alexcg1

As discussed in various meetings with @lusloher , @aga11313 , @FionnD

Kaggle is a lot of hoops for a user to jump through just to get an example working: install, set up key, run data getter script.

It's also work for us: We have to ensure datasets haven't moved or changed a lot, and we sometimes have to perform extra steps to process them.

These datasets are generally under creative commons licenses or similar. There's no reason why we can't:

  • Download a subset for example purposes (this keeps things light)
  • Process that subset ourselves (saves users time and effort)
  • Store it either in data/ (for light stuff like text which can go directly in repo) or use get_data.sh to download from somewhere we control (for larger stuff like images)

Affected examples

  • wikipedia-sentences
  • multires-lyrics-search
  • cross-modal-search
  • query-while-indexing

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions