Tag Archives: silly

GPT-175bee

Epistemic status: whimsical

Bees: a new unit of measurement for ML model size

Talking about modern ML models inevitably leads to a bunch of hard-to-intuit large numbers, especially when it comes to parameter count.

To address this, Lawrence Chan and I propose that we adopt a new, human-friendly unit to measure the number of learnable parameters in an architecture:

1 beepower = 1 BP = 1 billion parameters

Read the rest of this post on LessWrong.

Fun math facts about 2023

2023=7×172

Maybe that’s not fun enough? Try this:

2023=211−52

Or better yet:

20233=(31176029+245568392)/(384321573)

We can scientifically quantify how fun a math fact is, so we can rest assured that this is the funnest fact about 2023 ever discovered.

But if it’s not to your liking:

2023=(21034−1)/41

2023=(5511+24)/17

2023=(3647+27)/17

2023=(24792−36)/72=(2279/7)2−(33/7)2

Happy New Year!