Converting weights flat buffer

ultrons · March 25, 2021, 12:02am

Perhaps it’s a very basic question but I was not able to reconcile my observations with my understanding.

If I tie weights of an entire model to a flat buffer by doing something similar to this.
Imagine params are supplied by model.parameters().
Will weights hold the reference to the flat buffer or flat buffer stays separate in the memory.

In my observation the value of flat_buffer stays the same during training loop while model weights change as expected. Clearly weights did not get the reference.

Any suggestions to achieve that effect. i.e map all the weights of the model to flat buffer such that weights keep pointing to the flat buffer?

Perhaps something to that effect has been done in the gradient_view_as feature ?

Topic		Replies	Views
Weight sharing on cuda hardware-backends	12	2109	August 31, 2023
Detect shared weights across subgraphs in torch.compile FX	2	107	June 23, 2025
Backward module does not contains weight's gradients calculation FX	0	477	June 16, 2023
How to read the parameter info compiler	1	458	April 20, 2023
Rethinking PyTorch Fully Sharded Data Parallel (FSDP) from First Principles distributed	19	11742	September 17, 2024

Converting weights flat buffer

Related topics