Monday, Jan 1, 0001 | 3 minute read | Updated at Monday, Jan 1, 0001

@

扩散模型完整数学

Layout

Three-row horizontal layout: Row 1 shows “前向过程 Forward” (x₀→x_T with noise increasing), Row 2 shows “反向过程 Reverse” (x_T→x₀ with denoising), Row 3 shows “CFG Classifier-Free Guidance” formula and intuition. A large central vertical arrow connects Row 1 and Row 2. Title “扩散模型” at top.

ZONES

  • Title Zone: Top center, “扩散模型 Diffusion” in large handwritten font with a star ★, small noise dots decoration
  • Row 1 - Forward Process: Left to right flow
    • x₀ (clean image icon) → progressively noisier icons → x_T (pure noise icon)
    • Key formula displayed prominently below the flow
    • Show α_t, ᾱ_t notation
  • Row 2 - Reverse Process: Right to left flow (opposite direction)
    • x_T (noise) → progressively cleaner icons → x₀ (clean)
    • Loss function formula displayed prominently
    • DDPM sampling algorithm in a small code-like box
    • DDIM note as a side tag
  • Row 3 - CFG: Two parallel paths merging
    • Left path: ε_unc (unconditional prediction)
    • Right path: ε_cond (conditional prediction)
    • Merging point: CFG formula with guidance scale s
    • Visual: two arrows combining into one stronger arrow
  • Right Side Panel - 3D Advantages: Small vertical list of why diffusion beats GAN for 3D

LABELS

  • Forward Process: “前向加噪 Forward”, “q(x_t|x_{t-1}) = N(√α_t·x_{t-1}, β_t·I)”, “β_t: 10⁻⁴ → 0.02 线性调度”, “★核心公式: x_t = √ᾱ_t·x₀ + √(1-ᾱ_t)·ε”, “α_t = 1-β_t”, “ᾱ_t = ∏α_s”, “ε ~ N(0,I)”, “归纳法证明: 方差相加”
  • Reverse Process: “反向去噪 Reverse”, “L = E[‖ε - ε_θ(x_t, t)‖²]”, “预测噪声ε而非x₀ (优化landscape更平滑)”, “DDPM: x_{t-1} = (1/√α_t)(x_t - (1-α_t)/√(1-ᾱ_t)·ε) + √β_t·z”, “DDIM: 非马尔可夫 10~50步加速”
  • CFG: “Classifier-Free Guidance”, “ε̂ = ε_unc + s·(ε_cond - ε_unc)”, “s=1: 普通条件采样”, “s>1: 增强条件对齐 牺牲多样性”, “10%概率丢弃条件(训练时)”
  • 3D Panel: “3D优势”, “① MSE损失 训练稳定”, “② 分数函数 指向数据流形”, “③ CFG灵活条件化”, “④ 多尺度学习 粗→细”

COLORS

  • Background: Warm Cream (#F5F0E8)
  • Forward row: very light Macaron Blue (#A8D8EA) background strip
  • Reverse row: very light Peach (#FFD5C2) background strip
  • CFG row: very light Lavender (#D5C6E0) background strip
  • Clean data icons (x₀): Macaron Blue (#A8D8EA) fill
  • Noisy data icons (x_T): grey with scattered dots
  • Central connecting arrow: Mustard Yellow (#F2CC8F)
  • Core formula box: Peach (#FFD5C2) border, large Coral Red (#E8655A) text
  • Loss function: Coral Red (#E8655A)
  • DDPM code box: Mint (#B5E5CF) background
  • CFG formula: Coral Red (#E8655A) in large font
  • ε_unc path: Lavender (#D5C6E0)
  • ε_cond path: Macaron Blue (#A8D8EA)
  • Merged arrow: Coral Red (#E8655A) bold
  • 3D advantages panel: Mint (#B5E5CF) background with small checkmark icons

STYLE

Flat vector illustration. Clean black outlines on all elements. Progressive noise shown as icons going from clear geometric shapes to random scattered dots. Arrows are clean with arrowheads. Code-like box uses monospace-style font. Star ★ next to core formula. Small decorative noise patterns. No gradients. Keywords in bold handwritten-style font, formulas in clean sans-serif at large size.

ASPECT

3:4 portrait (tall format to accommodate three rows of content)

Clean composition with generous white space. Simple or no background. Main elements centered or positioned by content needs. Color values (#hex) and color names are rendering guidance only — do NOT display color names, hex codes, or palette labels as visible text in the image. Text should be large and prominent with handwritten-style fonts. Keep minimal, focus on keywords. Language: Chinese.

© 2021 - 2026 古月月仔的博客

🌱 Powered by Hugo with theme Dream.

关于我
  • 我是古月月仔
  • Shimizu Tou||Ethan Hu
  • 分享技术学习笔记与生活感悟杂谈
  • 现居: 上海 中国
  • 家乡: 平遥 山西
日常效率工具
  • 📝Typora — Markdown极简编辑器。
  • 📓Notion — 一站式笔记工作空间。
  • 🔗N8N — 强大的节点编排工作流工具。
  • 🤖Gemini — 好用的智能 AI 助手。
  • IamgetoUrl - 快速将图片转为URL。
  • Anyconv - 多种文件格式转换在线工具。
  • PDF24 - PDF编辑操作在线工具箱。
内容与资源工具
  • Mixamo - Adobe提供的免费角色动画库。
  • Bunlock-Music - 用于解密.ncm,.qmc类音频格式的工具。
  • EasyGIT - 一款在线GIF动图制作工具。
  • Alphacoders - 海量高清壁纸和影视/游戏原图社区。
  • 哲风壁纸 -中文壁纸分享与交流社区。
  • 爱给网 - 免费音效、配乐、视频模板等海量媒体素材。
  • 模之屋 - 专注于分享各类3D模型的社区。
我的爱好
  • 🚀 喜欢折腾各种好玩的应用技术
  • 📸 业余摄影爱好者
  • 🎮 各类游戏玩家
  • 💻 数码产品折腾爱好者
  • 📚 正在阅读:《人类简史》
  • 🎬 经典重温:《爱乐之城》