Adding Hyvarinen Score Matching Score for the normal distribution #396

zsk2002 · 2025-09-29T23:38:27Z

In this pull request, I added the Hyvarinen Score Matching Score based on the paper http://jmlr.org/papers/v6/hyvarinen05a.html. The performance in terms of point estimation and negative log likelihood is very close to the logScore presented in the paper.

Copilot

Pull Request Overview

This PR implements the Hyvarinen Score Matching Score for the normal distribution in NGBoost, based on the referenced JMLR paper. The implementation provides an alternative scoring method that reportedly achieves similar performance to the existing log score.

Key changes:

Added a generic ScoreMatchingScore base class in the scores module
Implemented NormalScoreMatchingScore with score computation, derivatives, and Fisher information matrix
Integrated the new scoring method into the Normal distribution's available scores

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File	Description
ngboost/scores.py	Adds generic ScoreMatchingScore base class with reference documentation
ngboost/distns/normal.py	Implements NormalScoreMatchingScore and adds it to Normal distribution's score options

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-01T16:42:55Z

ngboost/distns/normal.py

+    def score(self, Y):
+        loc = self.loc
+        var = self.var
+        var = var


This line is redundant - var is assigned to itself without any transformation. This should be removed.

Suggested change

var = var

Copilot · 2025-10-01T16:42:55Z

ngboost/distns/normal.py

+        n = len(Y)
+        D = np.zeros((len(Y), 2))
+        D[:, 0] =  (self.loc -Y)/(self.var **2)
+        D[:, 1] = 2/self.var - (2*(Y - self.loc)**2)/(self.var**2)


[nitpick] The expression (2*(Y - self.loc)**2)/(self.var**2) can be simplified to 2*(Y - self.loc)**2/self.var**2 for better readability.

Suggested change

D[:, 1] = 2/self.var - (2*(Y - self.loc)**2)/(self.var**2)

D[:, 1] = 2/self.var - 2*(Y - self.loc)**2/self.var**2

Copilot · 2025-10-01T16:42:56Z

ngboost/distns/normal.py

+        loc = self.loc
+        var = self.var
+        var = var
+        return ((Y - loc)**2 / (2* var**2)) - 1/var


[nitpick] Inconsistent spacing around the multiplication operator. Should be 2 * var**2 instead of 2* var**2.

Suggested change

return ((Y - loc)**2 / (2* var**2)) - 1/var

return ((Y - loc)**2 / (2 * var**2)) - 1/var

Copilot · 2025-10-01T16:42:56Z

ngboost/distns/normal.py

+    def d_score(self, Y):
+        n = len(Y)
+        D = np.zeros((len(Y), 2))
+        D[:, 0] =  (self.loc -Y)/(self.var **2)


[nitpick] Inconsistent spacing: extra space after = and missing space before -Y. Should be D[:, 0] = (self.loc - Y)/(self.var**2).

Suggested change

D[:, 0] = (self.loc -Y)/(self.var **2)

D[:, 0] = (self.loc - Y) / (self.var**2)

Copilot · 2025-10-01T16:42:56Z

ngboost/distns/normal.py

+        return ((Y - loc)**2 / (2* var**2)) - 1/var
+
+    def d_score(self, Y):
+        n = len(Y)


The variable n is defined but never used in the function. This line should be removed.

Suggested change

n = len(Y)

ryan-wolbeck · 2025-10-04T12:55:33Z

Please run make lint and make test in your environment to fix the linting issues causing the build to fail

zsk2002 added 3 commits September 29, 2025 18:29

Add Score Matching Score for Normal distributin

0fd95b9

Add a line

1dd28dc

Match pep8 style

0926f72

zsk2002 closed this Sep 29, 2025

zsk2002 reopened this Sep 29, 2025

alejandroschuler requested review from Copilot and ryan-wolbeck October 1, 2025 16:41

Copilot AI reviewed Oct 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Adding Hyvarinen Score Matching Score for the normal distribution #396

Adding Hyvarinen Score Matching Score for the normal distribution #396

Uh oh!

zsk2002 commented Sep 29, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 1, 2025

Uh oh!

Copilot AI Oct 1, 2025

Uh oh!

Copilot AI Oct 1, 2025

Uh oh!

Copilot AI Oct 1, 2025

Uh oh!

Copilot AI Oct 1, 2025

Uh oh!

ryan-wolbeck commented Oct 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	D[:, 1] = 2/self.var - (2(Y - self.loc)2)/(self.var*2)
	D[:, 1] = 2/self.var - 2(Y - self.loc)2/self.var*2

	return ((Y - loc)*2 / (2 var**2)) - 1/var
	return ((Y - loc)*2 / (2 var**2)) - 1/var

	D[:, 0] = (self.loc -Y)/(self.var **2)
	D[:, 0] = (self.loc - Y) / (self.var**2)

Uh oh!

Adding Hyvarinen Score Matching Score for the normal distribution #396

Are you sure you want to change the base?

Adding Hyvarinen Score Matching Score for the normal distribution #396

Uh oh!

Conversation

zsk2002 commented Sep 29, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

ryan-wolbeck commented Oct 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants