cover

Static to Stories

How do AI image tools make pictures?
You type "a cat wearing sunglasses on a skateboard" and click a button. ~~Three seconds later:~~ there it is, **a perfec

You type "a cat wearing sunglasses on a skateboard" and click a button. Three seconds later: there it is, a perfect picture that never existed before. No camera. No paintbrush. How does the computer do that?

The AI doesn't "see" pictures the way you do. To the computer, every image is just **millions of tiny numbers**—*one num

The AI doesn't "see" pictures the way you do. To the computer, every image is just millions of tiny numbersone number for how red each pixel is, one for how green, one for how blue. A photograph of a sunset? To the AI, it's a spreadsheet with 2 million cells.

~~Before it can make pictures~~, the AI has to learn what things look like. Humans feed it *millions of images*—cats, sk

Before it can make pictures, the AI has to learn what things look like. Humans feed it millions of images—cats, skateboards, mountains, robots, everything—each one labeled with words describing what's in it. The AI studies the number patterns: "Oh, pictures labeled 'cat' have these clusters of numbers. Pictures labeled 'skateboard' have those patterns."

~~But here's the weird part:~~ the AI learns by starting with **pure chaos**. Imagine a TV showing nothing but static—_r

But here's the weird part: the AI learns by starting with pure chaos. Imagine a TV showing nothing but static—random colored pixels with no picture at all. The AI's job is to learn how to turn that static into a real image, one tiny step at a time.

During training, the AI practices **millions of times**. Humans show it a real photo, then add static to it—_a little, t

During training, the AI practices millions of times. Humans show it a real photo, then add static to it—a little, then more, then so much the photo disappears completely into noise. The AI learns to run that process backward: "If I see this much noise, I can clean it up to look like this. If I see that pattern, I can sharpen it into a cat."

~~Now here's where your words come in.~~ When you type "a cat wearing sunglasses on a skateboard," the AI doesn't search

Now here's where your words come in. When you type "a cat wearing sunglasses on a skateboard," the AI doesn't search a database of pre-made pictures. Instead, it uses a part of its brain called the text encoderthink of it as a translator that turns your sentence into a special code, a mathematical recipe that means "catness + sunglasses + skateboard + cool pose."

The AI starts with random static and uses your recipe to guide the cleanup. It makes a tiny improvement: ~~"This corner

The AI starts with random static and uses your recipe to guide the cleanup. It makes a tiny improvement: "This corner should probably be orange, cat fur is often orange." Then another: "This blob should have edges, skateboards have edges." Then another, and another—maybe fifty steps total, each one nudging the noise closer to your description.

It's checking its work the whole time, asking "**Does this look like the patterns I learned** for 'cat'? For 'sunglasses

It's checking its work the whole time, asking "Does this look like the patterns I learned for 'cat'? For 'sunglasses'? For 'skateboard'?" When all fifty steps are done, the static has become a brand-new picture—one that never existed, but matches all the patterns the AI learned from millions of real images.

~~The cat isn't copied from any photograph.~~ ~~The sunglasses aren't traced from a drawing.~~ The AI invented them, _th

The cat isn't copied from any photograph. The sunglasses aren't traced from a drawing. The AI invented them, the same way you might dream up a new combination of things you've seen before—except the AI does it with math, turning numbers into pixels, static into stories, recipes into pictures.

How was this book?

A Wonderleaf Book

Static to Stories

— How do AI image tools make pictures? —

Wonderleaf Editions
— ex libris —
A Wonderleaf Book

Static to Stories

How do AI image tools make pictures?

Wonderleaf Editions · MMXXVI
Scene 1
You type "a cat wearing sunglasses on a skateboard" and click a button. ~~Three seconds later:~~ there it is, **a perfec
Static to Stories2
Scene 1

You type "a cat wearing sunglasses on a skateboard" and click a button. Three seconds later: there it is, a perfect picture that never existed before. No camera. No paintbrush. How does the computer do that?

3Static to Stories
Scene 2
The AI doesn't "see" pictures the way you do. To the computer, every image is just **millions of tiny numbers**—*one num
Static to Stories4
Scene 2

The AI doesn't "see" pictures the way you do. To the computer, every image is just millions of tiny numbersone number for how red each pixel is, one for how green, one for how blue. A photograph of a sunset? To the AI, it's a spreadsheet with 2 million cells.

5Static to Stories
Scene 3
~~Before it can make pictures~~, the AI has to learn what things look like. Humans feed it *millions of images*—cats, sk
Static to Stories6
Scene 3

Before it can make pictures, the AI has to learn what things look like. Humans feed it millions of images—cats, skateboards, mountains, robots, everything—each one labeled with words describing what's in it. The AI studies the number patterns: "Oh, pictures labeled 'cat' have these clusters of numbers. Pictures labeled 'skateboard' have those patterns."

7Static to Stories
Scene 4
~~But here's the weird part:~~ the AI learns by starting with **pure chaos**. Imagine a TV showing nothing but static—_r
Static to Stories8
Scene 4

But here's the weird part: the AI learns by starting with pure chaos. Imagine a TV showing nothing but static—random colored pixels with no picture at all. The AI's job is to learn how to turn that static into a real image, one tiny step at a time.

9Static to Stories
Scene 5
During training, the AI practices **millions of times**. Humans show it a real photo, then add static to it—_a little, t
Static to Stories10
Scene 5

During training, the AI practices millions of times. Humans show it a real photo, then add static to it—a little, then more, then so much the photo disappears completely into noise. The AI learns to run that process backward: "If I see this much noise, I can clean it up to look like this. If I see that pattern, I can sharpen it into a cat."

11Static to Stories
Scene 6
~~Now here's where your words come in.~~ When you type "a cat wearing sunglasses on a skateboard," the AI doesn't search
Static to Stories12
Scene 6

Now here's where your words come in. When you type "a cat wearing sunglasses on a skateboard," the AI doesn't search a database of pre-made pictures. Instead, it uses a part of its brain called the text encoderthink of it as a translator that turns your sentence into a special code, a mathematical recipe that means "catness + sunglasses + skateboard + cool pose."

13Static to Stories
Scene 7
The AI starts with random static and uses your recipe to guide the cleanup. It makes a tiny improvement: ~~"This corner
Static to Stories14
Scene 7

The AI starts with random static and uses your recipe to guide the cleanup. It makes a tiny improvement: "This corner should probably be orange, cat fur is often orange." Then another: "This blob should have edges, skateboards have edges." Then another, and another—maybe fifty steps total, each one nudging the noise closer to your description.

15Static to Stories
Scene 8
It's checking its work the whole time, asking "**Does this look like the patterns I learned** for 'cat'? For 'sunglasses
Static to Stories16
Scene 8

It's checking its work the whole time, asking "Does this look like the patterns I learned for 'cat'? For 'sunglasses'? For 'skateboard'?" When all fifty steps are done, the static has become a brand-new picture—one that never existed, but matches all the patterns the AI learned from millions of real images.

17Static to Stories
Scene 9
~~The cat isn't copied from any photograph.~~ ~~The sunglasses aren't traced from a drawing.~~ The AI invented them, _th
Static to Stories18
Scene 9

The cat isn't copied from any photograph. The sunglasses aren't traced from a drawing. The AI invented them, the same way you might dream up a new combination of things you've seen before—except the AI does it with math, turning numbers into pixels, static into stories, recipes into pictures.

19Static to Stories

~ finis ~

Tiny picture books for big little questions.

— a small constellation of questions —
Wonderleaf
Editions