-
Notifications
You must be signed in to change notification settings - Fork 19
[Experimental] Add sample datapackage.jsonld #101
base: master
Are you sure you want to change the base?
Conversation
|
Since |
| rm -r archive.org | ||
| ", | ||
| "size": "1 GB", | ||
| "resources": { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"resources" is the nomenclature used in datapackage.json.
| @@ -0,0 +1,38 @@ | |||
| { | |||
| "name": "scholarpedia.org", | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
REQUIRED in all of the references.
|
The most effective way to find the common ground, I think, is by packaging datasets that have already been packaged in these various standards and see which choice covers the most cases. |
|
@flyingzumwalt since you're the captain of this repo, RFCR? Is this not in the intended direction? |
|
I put 2 version of manifest file. One is the output of |
|
All of the issues you're closing pre-date my involvement. I will have to do a bit of reading in order to provide comments. That will have to wait until after the data.gov sprint. I'm currently maxed out dealing with that work. |
|
This PR packages the data with the scale of 10e5 orders of magnitude smaller than data.gov. Delivering data.gov depends on having the manifest/datapackage.json/packfile implemented, which can be done in parallel with smaller datasets. |
|
The issues this PR close basically contain which datasets have been published to ipfs. The main concern is the datapackage.json format and the packmanifest format. I don't know how to make this simpler to put: I decouple the task of having to spec these format from having to prepare a 300TB data in the first place. |
|
Just as a comment, here is the {
"last-synch": "2016-12-27T16:04:20.322511",
"name": "ipfs-ietf-rfc-archive",
"license": {
"url": "http://trustee.ietf.org/license-info/IETF-TLP-4.pdf",
"type": "other-open"
},
"title": "IETF RFC Archive",
"sources": [
{
"web": "https://www.rfc-editor.org/retrieve/",
"name": "RFC Editor"
}
],
"resources": [
{
"path": "rfc-index.txt",
"name": "rfc-index"
},
{
"path": "rfc-data/",
"name": "rfc-data"
},
{
"path": "update.py",
"name": "update-script"
}
],
"ipfs-github-issue": "https://github.com/ipfs/archives/issues/18"
}I based my format on http://specs.frictionlessdata.io/data-package/. What are your thoughts about this format, or the differences between this and your proposal? |
resourcesis normalizeddatapackage.jsonCloses: #86, #18, #32, #35