RFC: Skip Module Parameter Initialization

jbschlosser · May 5, 2021, 4:46pm

RFC: SKIP MODULE PARAMETER INITIALIZATION

Hey all, I’ve been working on enabling the ability to instantiate PyTorch modules with uninitialized parameters / buffers (see #29523). This functionality helps avoid unnecessary computation when doing non-standard parameter initialization, when loading from a serialized state_dict , etc.

This is currently supported by the following 2-step process:

import torch

# Initialize module on the meta device.
m = torch.nn.Linear(5, 1, device='meta')

# Move module to CPU with empty / uninitialized parameters.
m.to_empty(device='cpu')

This is a bit arcane for the simple “skip init” use case, so I opened a PR for a helper function that will do the above in a more sugary way:

import torch

# Instantiate a module with uninitialized parameters.
m = torch.nn.utils.skip_init(torch.nn.Linear, 5, 1)

I’m hoping for comments / suggestions / name bikeshedding for the proposed sugary version. Please add your opinions if you have them Thanks!

Topic		Replies	Views
State of model creation/initialization/seralization in PyTorch Core	1	1418	May 2, 2023
Proposed changes to how nn.Modules ought to be written	0	584	March 2, 2021
Adapting Models to use TorchScript and Getting them to Produce Fusions compiler	15	6526	February 26, 2021
Internals of Deferred Module Initialization	1	684	August 23, 2022
PyTorch 1.10 dev release notes release/packaging	1	2010	October 21, 2021

RFC: Skip Module Parameter Initialization

RFC: SKIP MODULE PARAMETER INITIALIZATION

Related topics