New Contributor Looking for Mentorship!

EmmettBicker · December 4, 2024, 6:50pm

Hi everyone,

I’m Emmett, an eighteen-year-old who’s passionate about contributing to PyTorch! I’ve been self-teaching myself machine learning for the past three years and I took a gap year before entering college to do independent research (this is my most recent ongoing project). I want to build my skills as a machine learning developer and I received advice that contributing to open source projects would be the best method to do so. I very frequently use PyTorch so it seemed like an obvious choice to contribute.

I would like to spend the upcoming months contributing to PyTorch full-time to build my skills and would really appreciate guidance/mentorship :).

So far I’ve made these two prs:

github.com/pytorch/pytorch

[lint] add flake8-comment linter

pytorch:main ← EmmettBicker:flake8-comment-lint

opened 08:56PM - 02 Dec 24 UTC

EmmettBicker

+116 -0

Fixes #127352 Adds a linter to check for `# flake8: noqa` comments that appe…ar mid-file. It's implemented by using the built-in module tokenize to scan all comments and from those comments it identifies `# flake8: noqa` comments with a regex. It only flags errors after the first non-comment token (so the `# flake8: noqa` can only be preceded by other comments). There are a considerable amount of files have mid-file `# flake8: noqa` comments which you can see by running ```bash lintrunner lint --take FLAKE8_COMMENT --all-files ``` I wasn't sure how to deal with these preexisting mid-file comments, as adding in this linter would then make calls to `lintrunner --all-files` fail where it was previously successful, so I'd greatly appreciate any advice on that issue before the PR is considered for merging.

github.com/pytorch/pytorch

[typing] added to torch._C._nn hints

pytorch:main ← EmmettBicker:gen_pyi_nn_typing

opened 03:31PM - 04 Dec 24 UTC

EmmettBicker

+259 -2

Works on #141828 This commit adds to `tools/pyi/gen_pyi.py` to generate more …type hints in `_nn.pyi`. I mainly created the hints by looking at the corresponding .h file in build/atn/src/Aten/ops, but there were often inconsistencies in the header files (such as excluding an out parameter or naming the first input "self" when in python it's "input") so I made tests to validate my hints that I didn't include in the commit. An example of one of these tests is attached at the bottom of the pr. I made a significant amount of progress, but there's still more to be done. Adding more stubs would be great, and some of the existing type stubs need to be fixed as I tested a few and they were missing an out parameter in their stub or the first input parameter was named "self" in the stub but the actual function's was "input". I added the hints for cross_entropy_loss, binary_cross_entropy, l1_loss, smooth_l1_loss, max_pool2d_with_indices, max_pool3d_with_indices, huber_loss, mse_loss, multilabel_margin_loss, soft_margin_loss, multi_margin_loss, upsample_nearest1d, upsample_nearest2d, upsample_nearest3d, _upsample_nearest_exact1d, _upsample_nearest_exact2d, and _upsample_nearest_exact3d, but max_unpool3d, adaptive_avg_pool2d, adaptive_avg_pool3d, glu, relu6_, relu6, elu, hardsigmoid_, silu_, silu, mish_, mish, hardswish_, hardswish, nll_loss_nd, upsample_linear1d, _upsample_bilinear2d_aa, upsample_bilinear2d, upsample_trilinear3d, _upsample_bicubic2d_aa, upsample_bicubic2d, im2col, col2im all still need hints. Here's an example of the tests I didn't include: ```python def test_cross_entropy_loss(): """ cross_entropy_loss": [ "def cross_entropy_loss({}) -> Tensor: ...". format( ", ".join( [ "input: Tensor", "target: Tensor", "weight: Optional[Tensor] = None", "reduction: int = 1", "ignore_index: int = -100", "label_smoothing: float = 0.0", ] ) ) """ try: torch._C._nn.cross_entropy_loss(InvalidType(), InvalidType()) raise Exception # these make sure the above code raises an error except Exception as e: assert e.__str__() == "cross_entropy_loss(): argument 'input' (position 1) must be Tensor, not InvalidType", e.__str__() try: torch._C._nn.cross_entropy_loss(t, InvalidType()) raise Exception except Exception as e: assert e.__str__() == "cross_entropy_loss(): argument 'target' (position 2) must be Tensor, not InvalidType", e.__str__() try: torch._C._nn.cross_entropy_loss(t, t, InvalidType()) raise Exception except Exception as e: assert e.__str__() == "cross_entropy_loss(): argument 'weight' (position 3) must be Tensor, not InvalidType", e.__str__() try: torch._C._nn.cross_entropy_loss(t, t, t, InvalidType()) raise Exception except Exception as e: assert e.__str__() == "cross_entropy_loss(): argument 'reduction' (position 4) must be int, not InvalidType", e.__str__() try: torch._C._nn.cross_entropy_loss(t, t, t, i, InvalidType()) raise Exception except Exception as e: assert e.__str__() == "cross_entropy_loss(): argument 'ignore_index' (position 5) must be int, not InvalidType", e.__str__() try: torch._C._nn.cross_entropy_loss(t, t, t, i, i, InvalidType()) raise Exception except Exception as e: assert e.__str__() == "cross_entropy_loss(): argument 'label_smoothing' (position 6) must be float, not InvalidType", e.__str__() try: torch._C._nn.cross_entropy_loss(t, t, t, i, i, f) raise Exception except Exception as e: assert e.__str__() == "ignore_index is not supported for floating point target", e.__str__() # means it ran the code try: torch._C._nn.cross_entropy_loss(t, t, t, i, i, f, out=InvalidType()) raise Exception except Exception as e: assert e.__str__() == "cross_entropy_loss() got an unexpected keyword argument 'out'", e.__str__() ```

And I’m interested in pretty much all tasks, especially more involved ones that require me to understand PyTorch more in depth.

I’m especially interested in working on the following issues:

github.com/pytorch/pytorch

Add support for sparse tensors with view_as_real

opened 12:35AM - 24 Jan 24 UTC

janeyx99

module: sparse feature module: optimizer triaged actionable

### 🚀 The feature, motivation and pitch Currently, `aten::view_as_real` is not …implemented for sparse tensors. A good reason to implement this is to enable us to support SparseAdam with complex dtypes. ### Alternatives _No response_ ### Additional context _No response_ cc @alexsamardzic @nikitaved @pearu @cpuhrsch @amjames @bhosmer @jcaip @vincentqb @jbschlosser @albanD @crcrpar

github.com/pytorch/pytorch

Implement Cut Cross Entropy

opened 04:56PM - 16 Nov 24 UTC

Skylion007

feature triaged module: python frontend

### 🚀 The feature, motivation and pitch https://github.com/apple/ml-cross-entro…py Better version of cross entropy for large vocab sizes, this allows small models to have a dramatically higher batch size than otherwise. Implementing this conditionally would be trivial based on the input / output dimension and dtype of the input. There is a Pure triton implementation available that we can use in a compiler pass or add a dynamic dispatch for the faster version for A100s ### Alternatives _No response_ ### Additional context _No response_ cc @albanD

janeyx99 · December 9, 2024, 9:12pm

Hi! Nice to see this after engaging with you in recent discussions like Optimizers' `differentiable` flag doesn't work · Issue #141832 · pytorch/pytorch · GitHub! Since you’re interested in contributing more than just piecewise, I’d be excited to work with you in tackling a more holistic medium-sized project around differentiable optimizers if you’re interested.

You’ve already gotten some context around action items, so this is to set a clearer goal to frame the bigger picture. Ultimately, we want to see differentiable optimizers have better support, test coverage, and documentation that they do today.

What would an ideal end state look like?
(1) Better support: people can run differentiable optimizers with lr, betas, and weight_decay as Tensors that require grad, meaning people can train their optimizer hyper params.

(2) Better documentation: we have a tutorial in GitHub - pytorch/tutorials: PyTorch tutorials. showing a real use case for differentiable optimizers, and our pytorch/pytorch documentation has a simpler code example. We also raise proper errors/warnings within the code linking to these resources.

(3) Fuller test coverage: our differentiable tests were excluded from our general test infrastructure migration to OptimizerInfo, but ideally, we’d use OptimizerInfos for these tests as well. An example test case that we should have our differentiable tests look like could be found in pytorch/test/test_optim.py at main · pytorch/pytorch · GitHub, see test_foreach_large_tensor. We’d want to use the OptimizerInfo infra to encompass all the new tests we want to add, like lr as a Tensor, etc.

Like most destinations, this end state can be achieve from several directions, and here’s a sample path taken from what we already delineated in the linked issue above:
Step 0 (could be done in parallel or first, based on preference): Migrate the current differentiable tests pytorch/test/optim/test_optim.py at main · pytorch/pytorch · GitHub to use OptimizerInfos + expand test coverage.
Step 1: support tensor LR when differentiable is True for SGD. Add a test case and docs in the code.
Step 2: now what if the tensor LR requires grad? Make sure this works and add a test case and docs in the code.
Step 3: Expand the above to different optimizers, Adam, AdamW, Adagrad, etc. Of course, add test cases and corresponding docs. This might be when it’d be good to consider using OptimizerInfos if you haven’t yet.
Step 4: Add error messaging.
Step 5: Add overarching docs on how to use differentiable optimizers and what’s supported. I could also see this being step 1, with gradual improvements as steps 1-3 are completed.

Let me know what you think!

EmmettBicker · December 10, 2024, 3:01am

This sounds wonderful and I would absolutely love to work with you on this project! Working on broader differentiable optimizer supports seems really meaningful and exciting. I would love to start working on step 0 before starting step 1, instead of working on both in parallel, because I haven’t used OptimizerInfo and I think I’ll have a bit to learn.

Thank you so much for this opportunity! I’ll start tomorrow after I tie up loose ends with other PyTorch PRs. Do you have a preferred method of communication for us to use?

janeyx99 · December 10, 2024, 7:00pm

Cool! I’ve sent you a message (I think? I haven’t sent messages on dev discuss before lol) for further communication.

By the way, when I meant in parallel, I just meant it could be done any time, not necessarily at the same time. For example, feel free to do step 1 independently, and then look at step 0. I find that it is easiest to start with something you’re already a bit familiar with, and there’s quite a lot of flexibility here in where you want to start.

EmmettBicker · December 10, 2024, 7:18pm

Ok that makes sense! And I have received your message.

dscamiss · April 17, 2025, 9:30pm

Hi all, are you looking for another contributor on this project? I’m also trying to build up PyTorch experience and I think I can help out. What’s the current status? From what I can tell, it looks like “Step 1” above (tensor lr for SGD) is complete but might be missing test code.

Topic		Replies	Views
Speed with the codebase in pytorch	2	534	September 22, 2023
Newcomer Contributor	4	405	December 4, 2024
Need help to get started	1	871	August 10, 2021
Contributing as a beginner	1	101	May 16, 2025
Tracing with Primitives: Update 2 compiler	4	6982	January 13, 2023

New Contributor Looking for Mentorship!

Related topics