separately save and load data (collected samples), using torch by sheim · Pull Request #9 · PFMassiani/edge

sheim · 2020-10-18T14:47:06Z

Forgot to commit and PR before going to Ticino. I ended up saving with torch instead of numpy, since train_x and train_y are actually torch tensors anyway. Although... I think torch.save is using pickle in the background.

PFMassiani · 2020-10-21T15:51:31Z

Hey,

Yes, saving using torch.save is better.
I find something to be a bit odd with GP.load: the user still needs to specify train_x and train_y even if load_dataset is specified. A way out could be making Dataset.load a @staticmethod, make train_x and train_y optional arguments in GP.load if load_dataset is specified, and load the GP with the saved data. Do you see a problem with this?

sheim · 2020-10-21T16:07:46Z

No, not really. I thought of it, and thought that forcing the user to specify a dummy train_x and train_y anyway is actually not such a bad idea, so they have to know how to initialize it.
Side note, I do prefer having a non-static method, so you can load the data in-place into an existing GP, instead of instantiating and returning a new GP to replace the existing one.
Not strong opinion either way though.

PFMassiani · 2020-10-21T16:18:25Z

Okay, I understand. When specifying a path to load the Dataset, I think that the user already knows how initialize the Dataset though. I'd find it a bit more natural to make train_x and train_y optional, but I don't have strong opinions either.

On the other hand, I disagree with the "staticness" of Dataset.load. Indeed, doing gp.dataset.load does not ensure that the GP object updates its internal state when its data is changed (currently, this is done by gp._set_gp_data_to_dataset). Therefore, just doing gp.dataset.load may lead to unexpected behaviour.
Nonetheless, the feature you describe is useful: this could be done in another method, that would load the given dataset and update the internal state? (GP.load_dataset or something)

PFMassiani · 2020-10-21T16:20:00Z

edge/model/inference/inference.py

        model.load_state_dict(save_dict['state_dict'])
+
+        if load_dataset:
+            model.dataset.load(load_dataset)


I think we also need to update the GP's internal state after this: self._set_gp_data_to_dataset()

I see. So then how about actually just really making these two things separate? load only loads the model. And separately, load_dataset loads a dataset. This doesn't make train_ optional though...

sheim · 2020-10-22T08:22:47Z

So currently, I split up saving/loading the dataset from saving and loading the model, so the original save and load work the same as before.

I tried making train_x and train_y optional by making them default to None, and checking that you either provide a path to a saved dataset, or provide training data... but this breaks due to the @tensorwrap decorator, which doesn't know how to handle None. Didn't want to go messing around with the decorator though.

sheim added 2 commits October 18, 2020 16:44

separately save and load data (collected samples), using torch

74dad0f

zap unneeded numpy import

db39f5a

PFMassiani reviewed Oct 21, 2020

View reviewed changes

split saving model and saving the dataset into separate methods

92af77a

PFMassiani added a commit that referenced this pull request Oct 23, 2020

Save and load data with model, adapted from PR #9

fb1a17a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

separately save and load data (collected samples), using torch#9

separately save and load data (collected samples), using torch#9
sheim wants to merge 3 commits intoPFMassiani:masterfrom
sheim:save_gp

sheim commented Oct 18, 2020

Uh oh!

PFMassiani commented Oct 21, 2020

Uh oh!

sheim commented Oct 21, 2020

Uh oh!

PFMassiani commented Oct 21, 2020 •

edited

Loading

Uh oh!

PFMassiani Oct 21, 2020

Uh oh!

sheim Oct 21, 2020

Uh oh!

sheim commented Oct 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sheim commented Oct 18, 2020

Uh oh!

PFMassiani commented Oct 21, 2020

Uh oh!

sheim commented Oct 21, 2020

Uh oh!

PFMassiani commented Oct 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PFMassiani Oct 21, 2020

Choose a reason for hiding this comment

Uh oh!

sheim Oct 21, 2020

Choose a reason for hiding this comment

Uh oh!

sheim commented Oct 22, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PFMassiani commented Oct 21, 2020 •

edited

Loading