nn_zero_to_hero

why in makemore2.jl does the loss bottom out at 4? the same code written with torch and flux consistently hits ~2.5ish

todo: build bigram word model on the concatenated yt transcripts

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
Project.toml		Project.toml
README.md		README.md
attention.ipynb		attention.ipynb
attention.jl		attention.jl
bigram_of_me.jl		bigram_of_me.jl
flux_mlp.jl		flux_mlp.jl
flux_mlp_clean.jl		flux_mlp_clean.jl
makemore.ipynb		makemore.ipynb
makemore.jl		makemore.jl
makemore2.ipynb		makemore2.ipynb
makemore2.jl		makemore2.jl
micrograd.ipynb		micrograd.ipynb
mlp_of_me.jl		mlp_of_me.jl
names.txt		names.txt

Provide feedback