Add a CrossEntropyError class. by erik-stripe · Pull Request #93 · stripe-archive/brushfire

erik-stripe · 2016-06-24T22:02:14Z

This PR gives us another way to evaluate how well predictions do
against the actual known distribution. The iris example has been
ported to demonstrated this method in practice.

There is also a small refactoring of the local trainer's validate
method, and some small refactors of other error classes.

This PR gives us another way to evaluate how well predictions do against the actual known distribution. The iris example has been ported to demonstrated this method in practice. There is also a small refactoring of the local trainer's validate method, and some small refactors of other error classes.

erik-stripe · 2016-06-24T22:02:37Z

Review by @tixxit, @avibryant, and/or @johnynek.

erik-stripe · 2016-06-24T22:03:48Z

There are some problems here -- please wait to merge until I fix them (Travis should notice them too).

avibryant · 2016-06-24T22:04:02Z

brushfire-scalding/src/main/scala/com/stripe/brushfire/scalding/Trainer.scala

  }

-  def tee[A](fn: ((TypedPipe[Instance[K, V, T]], Sampler[K], Iterable[(Int, Tree[K, V, T])])) => Execution[A]): Trainer[K, V, T] = {
+  def tee(fn: ((TypedPipe[Instance[K, V, T]], Sampler[K], Iterable[(Int, Tree[K, V, T])])) => Execution[_]): Trainer[K, V, T] = {


Just in the interests of increasing my Scala knowledge: what difference does this make?

We talked about removing this, but I balked at fixing all the code that would have to be adjusted.

avibryant · 2016-06-24T22:08:01Z

brushfire-training/src/main/scala/com/stripe/brushfire/local/Trainer.scala

+          tree.targetFor(features)
+        }.toVector
+
+      error.create(instance.target, voter.combine(predictions))


I think this is going to give you the wrong answer when predictions is empty (eg because this instance is not in the validation set). Who knows what voter.combine(Vector.empty) is going to produce, but it's totally possible (even, likely) that it will produce a non-zero error, which means we're accumulating all kinds of extra error for stuff that should have been filtered out.

I think we had convinced ourselves that there wouldn't be error in this case. But it's easy to add that special-case back in if need be.

I don't see how you can know that? There's no obvious law that Voter and Error need to conform to which would lead to

error.create(t, voter.combine(Vector.empty[T]]) == error.monoid.zero

johnynek · 2016-06-24T22:34:03Z

brushfire-training/src/main/scala/com/stripe/brushfire/local/CrossEntropyExample.scala

+    val totalNormalized: Map[String, Double] = CrossEntropyError.normalize(totalData)
+    val totalEntropy: Double = CrossEntropyError.entropy(totalNormalized)
+
+    def relativeEntropy(xn: (Double, Long)): Double = {


this should really be called normalizedInformation or something (how much of the total mutual information have we learned).

After 10 runs we got to like 60% or something as I recall.

If relativeEntropy is the metric we really care about, can we build that into the CrossEntropyError? It seems like the only thing we need to keep track of to compute this is the total distribution of actuals, which would be easy to include in the error monoid.

avibryant · 2016-06-25T16:52:36Z

Having other people extend Brushfire really makes me feel the lack of having written tests. This makes me feel bad but is good for improving the code.

Here's a law that I think we want to apply for all Error instances that I think this currently fails:

error.semigroup.plus(error.create(a1, p), error.create(a2, p)) ==
error.create(Semigroup.plus(a1,a2),p)

Please note, this does not hold for predictions, that is:

error.semigroup.plus(error.create(a, p1), error.create(a, p2)) !=
error.create(a, Semigroup.plus(p1,p2))

avibryant · 2016-06-25T16:55:22Z

brushfire-training/src/main/scala/com/stripe/brushfire/local/CrossEntropyExample.scala

+      Instance(line, 0L, Map(cols.zip(values): _*), Map(label -> 1L))
+    }.toIterable
+
+    val totalData: Map[String, Long] = Monoid.sum(trainingData.iterator.map(_.target))


It seems wrong to me that we are using the full training set, in any way, in an error computation; I feel like we should only be basing the error computation on the validation set.

yeah, that's fine. We just need an estimate of the "true" entropy. We should use the exact same set that we are measuring the error on below.

I guess we could fit it into the semigroup approach by aggregating the total label distribution as we go. (in that way, by the way), it seems like Error should really be a semigroup and a function (maybe a full aggregator).

Agreed that we should do that (that's what I meant in https://github.com/stripe/brushfire/pull/93/files/6aa9b7fe7adaaf9dbbce7f79abd767e56c4f2ecd#r68490840). I take your point about Error needing a present function. Error[T,P,E] could equally be Aggregator[(T,P),E,E2] where E2 is the thing you actually care about. Or you could decide you just cared about having an Ordering[E] (which comes up a lot in practice) and which would do the transformation to Double or whatever internally.

CLAassistant · 2020-08-06T12:18:18Z

All committers have signed the CLA.

avibryant reviewed Jun 24, 2016
View reviewed changes

Restore type parameter A to tee method.

ed1bb7b

We talked about removing this, but I balked at fixing all the code that would have to be adjusted.

avibryant reviewed Jun 24, 2016
View reviewed changes

Skip empty predictions, to avoid accumulating bogus error.

6aa9b7f

johnynek reviewed Jun 24, 2016
View reviewed changes

avibryant reviewed Jun 25, 2016
View reviewed changes

erik-stripe added 2 commits August 16, 2016 13:05

Use AverageValue, improve numerical stability.

a02f800

Fix bug with using AveragedValue.

1b16216

Comments

Conversation

erik-stripe commented Jun 24, 2016

Uh oh!

erik-stripe commented Jun 24, 2016

Uh oh!

erik-stripe commented Jun 24, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

avibryant commented Jun 25, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

CLAassistant commented Aug 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

avibryant commented Jun 25, 2016 •

edited

Loading

CLAassistant commented Aug 6, 2020 •

edited

Loading