Skip to content

Commit d969fa1

Browse files
authored
Merge branch 'main' into migrate-to-new-theme
2 parents b8fd3d7 + 9a44439 commit d969fa1

File tree

5 files changed

+301
-273
lines changed

5 files changed

+301
-273
lines changed

.jenkins/validate_tutorials_built.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@
2323
"beginner_source/examples_autograd/polynomial_autograd",
2424
"beginner_source/examples_autograd/polynomial_custom_function",
2525
"intermediate_source/mnist_train_nas", # used by ax_multiobjective_nas_tutorial.py
26-
"intermediate_source/fx_conv_bn_fuser",
26+
"intermediate_source/torch_compile_conv_bn_fuser",
2727
"intermediate_source/_torch_export_nightly_tutorial", # does not work on release
2828
"advanced_source/usb_semisup_learn", # fails with CUDA OOM error, should try on a different worker
2929
"prototype_source/fx_graph_mode_ptq_dynamic",

index.rst

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -349,13 +349,6 @@ Welcome to PyTorch Tutorials
349349

350350
.. Code Transformations with FX
351351
352-
.. customcarditem::
353-
:header: Building a Convolution/Batch Norm fuser in FX
354-
:card_description: Build a simple FX pass that fuses batch norm into convolution to improve performance during inference.
355-
:image: _static/img/thumbnails/cropped/Deploying-PyTorch-in-Python-via-a-REST-API-with-Flask.png
356-
:link: intermediate/fx_conv_bn_fuser.html
357-
:tags: FX
358-
359352
.. customcarditem::
360353
:header: Building a Simple Performance Profiler with FX
361354
:card_description: Build a simple FX interpreter to record the runtime of op, module, and function calls and report statistics
@@ -584,6 +577,13 @@ Welcome to PyTorch Tutorials
584577
:link: intermediate/torch_compile_tutorial.html
585578
:tags: Model-Optimization
586579

580+
.. customcarditem::
581+
:header: Building a Convolution/Batch Norm fuser in torch.compile
582+
:card_description: Build a simple pattern matcher pass that fuses batch norm into convolution to improve performance during inference.
583+
:image: _static/img/thumbnails/cropped/generic-pytorch-logo.png
584+
:link: intermediate/torch_compile_conv_bn_fuser.html
585+
:tags: Model-Optimization
586+
587587
.. customcarditem::
588588
:header: Inductor CPU Backend Debugging and Profiling
589589
:card_description: Learn the usage, debugging and performance profiling for ``torch.compile`` with Inductor CPU backend.
@@ -874,6 +874,7 @@ Additional Resources
874874
:maxdepth: 1
875875
:hidden:
876876

877+
877878
deep-dive
878879

879880
.. toctree::

intermediate_source/fx_conv_bn_fuser.py

Lines changed: 0 additions & 262 deletions
This file was deleted.

intermediate_source/fx_profiling_tutorial.py

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -216,9 +216,6 @@ def summary(self, should_sort : bool = False) -> str:
216216
#
217217
# * ``MaxPool2d`` takes up the most time. This is a known issue:
218218
# https://github.com/pytorch/pytorch/issues/51393
219-
# * BatchNorm2d also takes up significant time. We can continue this
220-
# line of thinking and optimize this in the Conv-BN Fusion with FX
221-
# `tutorial <https://pytorch.org/tutorials/intermediate/fx_conv_bn_fuser.html>`_.
222219
#
223220
#
224221
# Conclusion

0 commit comments

Comments
 (0)