[p5.strands] Significant refactor for p5.strands #8009

lukeplowden · 2025-07-30T12:05:27Z

Addresses #7868

Changes:

This (draft) PR is for a significant refactor for p5.strands I've been working on for the past month. Thank you to the contributors in other issues who have been patient in waiting for this update as it has blocked some progress in other areas. And thanks to all in Discord showing an interest too. I would love to get the thoughts of those who have been interested in contributing to p5.strands thus far (or any newcomers). This refactor is all about developer ergonomics for p5.strands:
@LalitNarayanYadav @perminder-17 @reshma045 @pratham-radadiya @ShaunakMishra25 @Orsenna187

At current, the refactor is just missing swizzles, a slight change needs to be made to the transpiler to make Unary operations work. Then, I need to do a once over and remove any extra types etc. which are left over from earlier stages in this refactor.

Overview of the refactor

The main purpose of this refactor is to make it more extendable to WGSL in the future, to modularise for developer ergonomics generally, and to make tests and FES easier to re-implement. It separates concerns throughout the p5.strands architecture and adds a much clearer type system. By modularising the codebase, a more straightforward roadmap and contributor documentation can be written up for p5.strands. More on that at the end of this PR, and I will leave some stubs for new issues related to this.

Entry point

p5.strands is still accessible through the same p5.Shader.modify method. The function override for this now exists in p5.strands.js, however. This file also initialises a strandsContext object, and also initialises the user API with this context. In the future, this file could potentially override createShader().

User API

The user API in strands_api.js includes all of the hooks, i.e. methods available p5.Shader.modify() such as getWorldInputs() , getFinalColor() and so on.

It also includes StrandsNode, a simplified class as compared to the previous implementation. Previously, the user had handles to classes derived from BaseNode. There were between 10-15 of these, each with slightly different methods and data, to handle all edge cases for both operations and also for code generation. This was confusing for developer experience, but also created the problem that it was hard to know where to document strands features, and what to document.

The StrandsNode class only contains user facing methods, like .add(), .mult(), and members for swizzling such as .xyz, .rrg etc. Apart from that, it has a this.id which corresponds to an ID in the compilers Intermediate Representation. More on this later, but overall the user API is less tied to backend specifics now.

This file also contains a few more functions like type constructors (vec3, float, also now with ivec3, bool etc.), strandsIf() and discard() which are in progress, and (now I'm reminded I need to add this:) instanceID() as before.

Finally, it also pulls in functions from strands_builtins.js. These are similar as in the previous implementation, except now with a more robust type system which is explained below. @LalitNarayanYadav, you might be interested in reviewing this and potentially re-porting lerp here (sorry!) and copying noise across too, which shouldn't need to change!

Stages of the compiler

The p5.strands compiler is broken more clearly into separate stages. These are similar, but a bit different, to the classic three stages of a compiler. Previously, These stages were shared between the BaseNode class and its children, the ShaderGenerator class, and the p5.Shader.modify() method. The resulting codebase was becoming difficult to extend, and also difficult to summarise.

1. Front-end: Transpile Stage

Overview: Transpiles from the p5.strands 'language' to the JavaScript API.
Files: strands_transpiler.js
External Dependencies: ESCodegen and Acorn

adds operator overloading to allow normal JS operators ([], +, -, == etc) to work on Strands Nodes.
It works by using Acorn to generate an AST, traversing the AST and replacing nodes. Then, it uses ESCodegen to turn this back into code.

2. Middle-end: Building the Intermediate Representation (IR)

Overview: Builds graphs which represent the user's code
Files: ir_dag.js, ir_cfg.js, ir_types.js, ir_builders.js

ir_builders.js is one step beyond the User API file. The functions in here do most of the heavy lifting in building up the IR graphs. All of the functions in the User API call to here.

When the user calls methods like .add() or vec3(), they are returned a user facing StrandsNode as mentioned above. However, this also builds a node in the IR's directed acyclic graph (DAG), which model data dependencies, and records its existence in the control flow graph (CFG), which models data flow. These graphs are implemented in the ir_dag.js and ir_cfg.js files respectively.

The users nodes are handles to nodes in the DAG. So this includes variables and operations (that's it for the most part). There are no 'no-ops' at current. Inside of strandsIf(), a new 'basic block' is made in the CFG. The strandsContext (via the builder functions) keeps track of the current block, and any user instructions (like a function call or addition) are recorded in the current block.

The ir_types.js file has a number of pseudo enums and look up tables for different types. These include BlockType or basic blocks, NodeType for variables vs operations (maybe name is too ambiguous now but use if obvious), etc.

The most obvious (and complex) of these are DataType's which model types such as float, int and their vector variations. As a shader DSL based in JS, I've arrived at objects with a shape: { baseType: 'float', dimension: '1', priority: '3' } etc. Therefore you can compose a final shader type by doing node.baseType + node.dimension, which just separates our types from GLSL a bit for down the road.

Once the user's code has finished running and all of the graphs are built, we do a topological sort on the CFG. We are able to topo sort because, although there are kind of back-edges in the graph, we don't really need to model goto's purely, we just need to output code gen if() in the codegen. This is still a work in progress, however.

3. Back-end: Code generation

Overview: Generates GLSL code from the intermediate representation
Files: strands_codegen.js, strands_glslBackend.js

This does as it says: generates GLSL code from the IR. We currently do the CFG sort in this section, and create generationContext object to store our lines of generated code, and temporary variable names. Next, we loop over the basic blocks and output the code for each visited node.

We only have to use some of the same types from the IR, but most of the heavy lifting is already done (as mentioned) and the code output is relatively simpler code. It is similarly structured to Acorn's visitor functions: we define an object with different visitor functions for different node types.

Importantly, the WGSL implementation should be a similarly simple process to add, and could be done by a direct port of the ``strands_glslBackend.js` file.

FES file

I have also disabled and reenabled FES in the strandsContext object as before, however I have also added a temporary strands_FES.js file here. There are several places in which I have added user errors, but I'm not sure on the best approach for this and have to look more deeply at the rest of FES before overriding it.

Next steps / input

Right now, there are few classes (only user facing ones, in order to have chainable methods easily). I was reading about data oriented design whilst making this (not saying its perfect) but ended up having few classes because of this. It also means that the graphs are structs of arrays, and nodes are just indices into them. If people feel strongly I can refactor these into classes. For example, strandsContext could become class StrandsRuntime or similar:

function initStrandsContext(ctx, backend) {
    ctx.dag = createDirectedAcyclicGraph();
    ctx.cfg = createControlFlowGraph();
    ctx.uniforms = [];
    ctx.hooks = [];
    ctx.backend = backend;
    ctx.active = true;
    ctx.previousFES = p5.disableFriendlyErrors;
    p5.disableFriendlyErrors = true;
}

I'm not sure how I feel about the current (broken) approach to swizzling. Maybe it was better to have Proxy objects as in the previous implementation. I don't like attaching hundreds of members of xyzw permutations to the StrandsNodes prototype, what do you think @davepagurek ?
There are fair number of new files and a new strands folder added to the repo in this PR. How do you feel about that and also naming conventions @ksen0?
In the coming weeks, this writeup could be adapted into a proper contributor docs outlining all of this in a succinct (and visual way).
It could be neat to represent the IR graphs in a p5.sketch (a shader) and use this as a visual test. Visualising the language as much as possible will help contributors to understand how its working.
Once I have figured out the if statements properly and finally, loops would follow a similar structure and a good issue for somebody to tackle if they want to
There is a possibility to optimize the shader code after the IR is built, for example there's a template for constant folding already in ir_types. I'm just not sure whether this will actually optimize anything, or whether the respective backend compiler (GLSL/ WGPU) will do a better job anyway.
As mentioned, would be good to get somebody from FES @IIITM-Jay looking at this at some point, although no rush for now.
As the type system mas matured, we are more capable of defining the input structs to the hooks in our IR. I can see a possibility that more and more of our internal shader workings could run on p5.strands

Will write anything down here as I think of more

PR Checklist

npm run lint passes
Inline reference is included / updated
Unit tests are included / updated

…gnments)

…not p5 defined structs such as Vertex inputs)

…strands-refactor

…ready a node.

…k on swizzles

…not p5 defined structs such as Vertex inputs)

…ready a node.

…k on swizzles

…strands-refactor

davepagurek

Thanks for all this work, it's looking good!

Just for my own understanding (and to maybe put in a doc somewhere at some point), the distinction between the control flow graph and the DAG is that the DAG stores a node for each state of each variable as it goes through the program, and the CFG controls the higher level constructs like functions, loops, and if statements that break up the values? I'm sort of picturing the CFG and DAG nodes as all inhabiting the same overall graph, sort of like these rough diagrams from when we were talking earlier:

But in the above picture it's not clear what goes in each block, so like these would both be equivalent:

let a = 1 let b = 2 if (b > 1) { let c = 3 a = c } return a	let a = 1 let b = 2 let condition = b > 1 let c = 3 if (condition) { a = c } return a

So are the control flow graph nodes sort of like the big IfElse block in there but that also draw a line around which values should be within the different parts of the if?

davepagurek · 2025-07-30T20:25:42Z

src/strands/ir_builders.js

+  let { dimension, baseType } = typeInfo;
+
+  if (dimension !== 1) {
+    FES.internalError('Created a literal node with dimension > 1.')


Suggested change

FES.internalError('Created a literal node with dimension > 1.')

FES.internalError('Created a scalar literal node with dimension > 1.')

davepagurek · 2025-07-30T21:52:19Z

src/strands/p5.strands.js

+    p5.disableFriendlyErrors = true;
+  }
+
+  function deinitStrandsContext(ctx) {


Do we need to set ctx.active = false in here? Looks like some of the test failures may be due to the context remaining active

davepagurek · 2025-07-30T21:55:14Z

src/strands/strands_transpiler.js

+    // The callbacks for AssignmentExpression and BinaryExpression handle
+    // operator overloading including +=, *= assignment expressions
+    ArrayExpression(node, _state, _ancestors) {
+      const original = JSON.parse(JSON.stringify(node));


I think we'll need to re-apply the early returns added in #7961, where we check for an ancestor being a uniform

davepagurek · 2025-07-30T22:00:59Z

src/strands/strands_transpiler.js

+}
+
+function ancestorIsUniform(ancestor) {
+  return ancestor.type === 'CallExpression'


There's also an updated version of this in #7961 that handles instance mode

davepagurek · 2025-07-30T22:04:30Z

src/webgl/ShaderGenerator.js

@@ -1116,13 +1116,12 @@ function shadergenerator(p5, fn) {
      GLOBAL_SHADER = this;
      this.userCallback = userCallback;
      this.srcLocations = srcLocations;
-      this.cleanup = () => {};
      this.generateHookOverrides(originalShader);
      this.output = {


Is this file still being used?

davepagurek · 2025-07-30T22:18:21Z

src/strands/ir_dag.js

+}
+
+export function getOrCreateNode(graph, node) {
+  // const key = getNodeKey(node);


haha I guess we're not getting, just creating? just double checking if these need to be uncommented

davepagurek · 2025-07-30T22:37:45Z

src/strands/strands_api.js

+        },
+        ...(hasDuplicates ? {} : {
+          set(value) {
+            return assignSwizzleNode(strandsContext, this, swizzle, value);


Is this defined?

davepagurek · 2025-07-30T22:47:52Z

src/strands/ir_types.js

+  [NodeType.VARIABLE]: ["identifier", "dimension", "baseType"],
+  [NodeType.CONSTANT]: ["value", "dimension", "baseType"],
+  [NodeType.STRUCT]: [""],
+  [NodeType.PHI]: ["dependsOn", "phiBlocks", "dimension", "baseType"],


What does this type represent?

davepagurek · 2025-07-30T22:49:01Z

src/strands/ir_types.js

+  [NodeType.OPERATION]: ["opCode", "dependsOn", "dimension", "baseType"],
+  [NodeType.LITERAL]: ["value", "dimension", "baseType"],
+  [NodeType.VARIABLE]: ["identifier", "dimension", "baseType"],
+  [NodeType.CONSTANT]: ["value", "dimension", "baseType"],


is this used currently, or just for the future?

davepagurek · 2025-07-30T23:01:40Z

src/strands/strands_glslBackend.js

+  [BlockType.DEFAULT]: (blockID, strandsContext, generationContext) => {
+    const { dag, cfg } = strandsContext;
+
+    const instructions = cfg.blockInstructions[blockID] || [];


Do these have to be sorted by DAG order to be valid? (Are these naturally stored in sorted order already?)

lukeplowden added 30 commits June 24, 2025 16:47

syntax/ remove unneccessary

23ff7e6

blocking out new modular strands structure

1511ffb

chipping away at DOD approach.

604c2dd

nested ifs

8950817

if/else semi working

f6369e7

change if/elseif/else api to be chainable and functional (return assi…

a355416

…gnments)

binary ops and contructors prototyped

3e1e149

simplify type system

f718717

SSA

24f0c46

Return type checking for hooks with native types reimplemented (i.e. …

0851285

…not p5 defined structs such as Vertex inputs)

declarations moved to backend, hook arguments fixed

9b84f6f

rename file

8509231

update api imports for new filename

47eda1a

move extractTypeInfo and rename to extractNodeTypeInfo

1088b4d

rename files for clarity

87e8a99

builtin function overloads type checking

e32fd47

function calls partially reimplemented. Still needs more error checking.

11a1610

update function calls to conform parameters when raw numbers are handed

e8f03d6

adding struct types

1ddd9a2

adding struct types

f3155e6

Merge branch 'strands-refactor' of github.com:lukeplowden/p5.js into …

babedfd

…strands-refactor

struct types working

afff707

comment old line. Should revisit structs if needs optimisation.

2e70e0e

fix wrong ID in binary op node

6d5913a

fix bug with binary op, and make strandsNode return node if arg is al…

2745bda

…ready a node.

fix function call bugs

4133fae

remove dag sort, use basic block instructions instead. Also start wor…

b3ce3ec

…k on swizzles

syntax/ remove unneccessary

9ebf77e

blocking out new modular strands structure

faae3aa

chipping away at DOD approach.

f6783d2

lukeplowden added 22 commits July 30, 2025 11:34

binary ops and contructors prototyped

627b7a3

simplify type system

7899f0d

SSA

b731c15

Return type checking for hooks with native types reimplemented (i.e. …

7166f35

…not p5 defined structs such as Vertex inputs)

declarations moved to backend, hook arguments fixed

e4e54ac

rename file

51e8ddd

update api imports for new filename

79c2f8d

move extractTypeInfo and rename to extractNodeTypeInfo

18dc1d3

rename files for clarity

eb5f1bf

builtin function overloads type checking

446d3ec

function calls partially reimplemented. Still needs more error checking.

83b4cf4

update function calls to conform parameters when raw numbers are handed

a743c68

adding struct types

295c140

adding struct types

7cd3d42

struct types working

f7b1339

comment old line. Should revisit structs if needs optimisation.

ba4be8b

fix wrong ID in binary op node

4fe4aaf

fix bug with binary op, and make strandsNode return node if arg is al…

0908e43

…ready a node.

fix function call bugs

5ce9451

remove dag sort, use basic block instructions instead. Also start wor…

54851ba

…k on swizzles

Merge branch 'strands-refactor' of github.com:lukeplowden/p5.js into …

2b681b8

…strands-refactor

change example

ebaaa08

lukeplowden added p5.js 2.0 p5.strands labels Jul 30, 2025

lukeplowden added this to the 2.1 milestone Jul 30, 2025

lukeplowden added this to p5.js 2.x 🌱🌳 Jul 30, 2025

lukeplowden marked this pull request as draft July 30, 2025 12:36

lukeplowden requested a review from davepagurek July 30, 2025 12:37

davepagurek reviewed Jul 30, 2025

View reviewed changes

ksen0 moved this to Open for Discussion in p5.js 2.x 🌱🌳 Jul 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[p5.strands] Significant refactor for p5.strands #8009

[p5.strands] Significant refactor for p5.strands #8009

Uh oh!

lukeplowden commented Jul 30, 2025 •

edited

Loading

Uh oh!

davepagurek left a comment •

edited

Loading

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

davepagurek Jul 30, 2025

Uh oh!

Uh oh!

	FES.internalError('Created a literal node with dimension > 1.')
	FES.internalError('Created a scalar literal node with dimension > 1.')

Uh oh!

[p5.strands] Significant refactor for p5.strands #8009

Are you sure you want to change the base?

[p5.strands] Significant refactor for p5.strands #8009

Uh oh!

Conversation

lukeplowden commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

Overview of the refactor

Entry point

User API

Stages of the compiler

1. Front-end: Transpile Stage

2. Middle-end: Building the Intermediate Representation (IR)

3. Back-end: Code generation

FES file

Next steps / input

PR Checklist

Uh oh!

davepagurek left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lukeplowden commented Jul 30, 2025 •

edited

Loading

davepagurek left a comment •

edited

Loading