xprof user guide #2300

suexu1025 · 2025-09-05T21:53:18Z

Description

Start with a short description of what the PR does and how this is a change from
the past.

The rest of the description includes relevant details and context, examples:

why is this change being made,
the problem being solved and any relevant context,
why this is a good solution,
some information about the specific implementation,
shortcomings of the solution and possible future improvements.

If the change fixes a bug or a Github issue, please include a link, e.g.,:
FIXES: b/123456
FIXES: #123456

Notice 1: Once all tests pass, the "pull ready" label will automatically be assigned.
This label is used for administrative purposes. Please do not add it manually.

Notice 2: For external contributions, our settings currently require an approval from a MaxText maintainer to trigger CI tests.

Tests

Please describe how you tested this change, and include any instructions and/or
commands to reproduce.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed.

richjames0

LGTM

bvandermoon

Can we check with the profiling team on how this doc should be framed? There is already detailed documentation that exists for the profiler tools. Should we just link to that instead?

RissyRan

Thanks Qinwen!

RissyRan · 2025-09-08T17:52:34Z

docs/guides/xprof_user_guide.md

+
+
+
+*   **Sampling Mode:** This mode allows for continuous profiling by sampling data during model execution.


Out of curious, how to do Sampling?

RissyRan · 2025-09-08T17:58:35Z

docs/guides/xprof_user_guide.md

+
+## Introduction to Xprof
+
+Xprof is a powerful tool designed for profiling and analyzing the training performance of AI models. For Maxtext developers, understanding and utilizing Xprof can significantly help in optimizing model performance, identifying bottlenecks, and improving training efficiency.


I think Xprof is not open sourced? We probably should recommend to use tensorboard or other OSS tools instead, like cloud version?

RissyRan · 2025-09-08T18:01:13Z

docs/guides/xprof_user_guide.md

+
+
+
+*   Trace Viewer


It will be great if we have some screenshot to show customers especially for someone is not familiar with tool, like Trace View. But not mandatory (if you think this is clear) :)

Similar comments for other section.

gobbleturk · 2025-09-08T18:14:22Z

docs/guides/xprof_user_guide.md

None of this is maxtext specific, is there any xprof documentation we can point to instead?

I would find some xprof folks to review this as well and comment if we can't find any, rjesha@ probably knows who to reach out to

xprof user guide

51d9b0d

suexu1025 requested review from gobbleturk, khatwanimohit, bvandermoon, vipannalla, RissyRan, richjames0, gagika, shralex, yangyuwei, SurbhiJainUSC, hengtaoguo, A9isha, aireenmei and NuojCheng as code owners September 5, 2025 21:53

clean up image links

062ff44

richjames0 approved these changes Sep 8, 2025

View reviewed changes

bvandermoon reviewed Sep 8, 2025

View reviewed changes

RissyRan reviewed Sep 8, 2025

View reviewed changes

gobbleturk reviewed Sep 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

xprof user guide #2300

xprof user guide #2300

Uh oh!

suexu1025 commented Sep 5, 2025

Uh oh!

richjames0 left a comment

Uh oh!

bvandermoon left a comment

Uh oh!

RissyRan left a comment

Uh oh!

RissyRan Sep 8, 2025

Uh oh!

RissyRan Sep 8, 2025

Uh oh!

RissyRan Sep 8, 2025

Uh oh!

gobbleturk Sep 8, 2025

Uh oh!

gobbleturk Sep 8, 2025

Uh oh!

Uh oh!




		* Sampling Mode: This mode allows for continuous profiling by sampling data during model execution.


		## Introduction to Xprof

		Xprof is a powerful tool designed for profiling and analyzing the training performance of AI models. For Maxtext developers, understanding and utilizing Xprof can significantly help in optimizing model performance, identifying bottlenecks, and improving training efficiency.

xprof user guide #2300

Are you sure you want to change the base?

xprof user guide #2300

Uh oh!

Conversation

suexu1025 commented Sep 5, 2025

Description

Tests

Checklist

Uh oh!

richjames0 left a comment

Choose a reason for hiding this comment

Uh oh!

bvandermoon left a comment

Choose a reason for hiding this comment

Uh oh!

RissyRan left a comment

Choose a reason for hiding this comment

Uh oh!

RissyRan Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

RissyRan Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

RissyRan Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

gobbleturk Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

gobbleturk Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!