首页下载资源人工智能大模型实战教程,从0手撸LLM

ZIP大模型实战教程,从0手撸LLM

u01381840642.65MB需要积分:1

资源文件列表:

llms-from-scratch-cn-main.zip 大约有1112个文件
  1. llms-from-scratch-cn-main/
  2. __MACOSX/._llms-from-scratch-cn-main 212B
  3. llms-from-scratch-cn-main/Translated_Book/
  4. __MACOSX/llms-from-scratch-cn-main/._Translated_Book 212B
  5. llms-from-scratch-cn-main/images/
  6. __MACOSX/llms-from-scratch-cn-main/._images 212B
  7. llms-from-scratch-cn-main/README.md 12.86KB
  8. __MACOSX/llms-from-scratch-cn-main/._README.md 212B
  9. llms-from-scratch-cn-main/Model_Architecture_Discussions/
  10. __MACOSX/llms-from-scratch-cn-main/._Model_Architecture_Discussions 212B
  11. llms-from-scratch-cn-main/.gitignore 3.22KB
  12. __MACOSX/llms-from-scratch-cn-main/._.gitignore 212B
  13. llms-from-scratch-cn-main/Book/
  14. __MACOSX/llms-from-scratch-cn-main/._Book 212B
  15. llms-from-scratch-cn-main/Codes/
  16. __MACOSX/llms-from-scratch-cn-main/._Codes 212B
  17. llms-from-scratch-cn-main/LICENSE.txt 1.02KB
  18. __MACOSX/llms-from-scratch-cn-main/._LICENSE.txt 212B
  19. llms-from-scratch-cn-main/Translated_Book/ch01/
  20. __MACOSX/llms-from-scratch-cn-main/Translated_Book/._ch01 212B
  21. llms-from-scratch-cn-main/Translated_Book/ch04/
  22. __MACOSX/llms-from-scratch-cn-main/Translated_Book/._ch04 212B
  23. llms-from-scratch-cn-main/Translated_Book/ch03/
  24. __MACOSX/llms-from-scratch-cn-main/Translated_Book/._ch03 212B
  25. llms-from-scratch-cn-main/Translated_Book/img/
  26. __MACOSX/llms-from-scratch-cn-main/Translated_Book/._img 212B
  27. llms-from-scratch-cn-main/Translated_Book/ch02/
  28. __MACOSX/llms-from-scratch-cn-main/Translated_Book/._ch02 212B
  29. llms-from-scratch-cn-main/Translated_Book/ch05/
  30. __MACOSX/llms-from-scratch-cn-main/Translated_Book/._ch05 212B
  31. llms-from-scratch-cn-main/images/mental-model.jpg 173.65KB
  32. __MACOSX/llms-from-scratch-cn-main/images/._mental-model.jpg 212B
  33. llms-from-scratch-cn-main/images/cover.jpg 47.2KB
  34. __MACOSX/llms-from-scratch-cn-main/images/._cover.jpg 212B
  35. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/
  36. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._llama3 212B
  37. llms-from-scratch-cn-main/Model_Architecture_Discussions/phi-3/
  38. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._phi-3 212B
  39. llms-from-scratch-cn-main/Model_Architecture_Discussions/olmo/
  40. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._olmo 212B
  41. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/
  42. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._MiniCPM 212B
  43. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v1/
  44. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._rwkv-v1 212B
  45. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/
  46. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._rwkv-v6 212B
  47. llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/
  48. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._pangu 212B
  49. llms-from-scratch-cn-main/Model_Architecture_Discussions/mamba/
  50. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._mamba 212B
  51. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/
  52. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._rwkv-compare 212B
  53. llms-from-scratch-cn-main/Model_Architecture_Discussions/.keep
  54. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._.keep 212B
  55. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/
  56. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._ChatGLM4 212B
  57. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/
  58. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._ChatGLM3 212B
  59. llms-from-scratch-cn-main/Model_Architecture_Discussions/img/
  60. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._img 212B
  61. llms-from-scratch-cn-main/Model_Architecture_Discussions/openelm/
  62. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._openelm 212B
  63. llms-from-scratch-cn-main/Model_Architecture_Discussions/gptj/
  64. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._gptj 212B
  65. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/
  66. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._rwkv-v3 212B
  67. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v4/
  68. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._rwkv-v4 212B
  69. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/
  70. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._rwkv-v5 212B
  71. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/
  72. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._rwkv-v2 212B
  73. llms-from-scratch-cn-main/Model_Architecture_Discussions/phi/
  74. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/._phi 212B
  75. llms-from-scratch-cn-main/Book/ch06/
  76. __MACOSX/llms-from-scratch-cn-main/Book/._ch06 212B
  77. llms-from-scratch-cn-main/Book/ch01/
  78. __MACOSX/llms-from-scratch-cn-main/Book/._ch01 212B
  79. llms-from-scratch-cn-main/Book/ch04/
  80. __MACOSX/llms-from-scratch-cn-main/Book/._ch04 212B
  81. llms-from-scratch-cn-main/Book/ch03/
  82. __MACOSX/llms-from-scratch-cn-main/Book/._ch03 212B
  83. llms-from-scratch-cn-main/Book/ch02/
  84. __MACOSX/llms-from-scratch-cn-main/Book/._ch02 212B
  85. llms-from-scratch-cn-main/Book/ch05/
  86. __MACOSX/llms-from-scratch-cn-main/Book/._ch05 212B
  87. llms-from-scratch-cn-main/Codes/ch07/
  88. __MACOSX/llms-from-scratch-cn-main/Codes/._ch07 212B
  89. llms-from-scratch-cn-main/Codes/ch06/
  90. __MACOSX/llms-from-scratch-cn-main/Codes/._ch06 212B
  91. llms-from-scratch-cn-main/Codes/ch01/
  92. __MACOSX/llms-from-scratch-cn-main/Codes/._ch01 212B
  93. llms-from-scratch-cn-main/Codes/appendix-B/
  94. __MACOSX/llms-from-scratch-cn-main/Codes/._appendix-B 212B
  95. llms-from-scratch-cn-main/Codes/ch04/
  96. __MACOSX/llms-from-scratch-cn-main/Codes/._ch04 212B
  97. llms-from-scratch-cn-main/Codes/ch03/
  98. __MACOSX/llms-from-scratch-cn-main/Codes/._ch03 212B
  99. llms-from-scratch-cn-main/Codes/ch02/
  100. __MACOSX/llms-from-scratch-cn-main/Codes/._ch02 212B
  101. llms-from-scratch-cn-main/Codes/ch05/
  102. __MACOSX/llms-from-scratch-cn-main/Codes/._ch05 212B
  103. llms-from-scratch-cn-main/Codes/appendix-A/
  104. __MACOSX/llms-from-scratch-cn-main/Codes/._appendix-A 212B
  105. llms-from-scratch-cn-main/Translated_Book/ch01/1.1什么是LLM.md 20.69KB
  106. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._1.1什么是LLM.md 212B
  107. llms-from-scratch-cn-main/Translated_Book/ch01/1.0理解大型语言模型.md 14.14KB
  108. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._1.0理解大型语言模型.md 212B
  109. llms-from-scratch-cn-main/Translated_Book/ch01/1.8总结.ipynb 2.63KB
  110. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._1.8总结.ipynb 212B
  111. llms-from-scratch-cn-main/Translated_Book/ch01/.keep
  112. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._.keep 212B
  113. llms-from-scratch-cn-main/Translated_Book/ch01/1.6深入剖析GPT架构.ipynb 5.44KB
  114. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._1.6深入剖析GPT架构.ipynb 212B
  115. llms-from-scratch-cn-main/Translated_Book/ch01/1.7构建大语言模型.ipynb 2.47KB
  116. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._1.7构建大语言模型.ipynb 212B
  117. llms-from-scratch-cn-main/Translated_Book/ch01/1.2LLMs的应用.md 5.15KB
  118. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._1.2LLMs的应用.md 212B
  119. llms-from-scratch-cn-main/Translated_Book/ch01/1.5利用大型数据集.ipynb 5.59KB
  120. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._1.5利用大型数据集.ipynb 212B
  121. llms-from-scratch-cn-main/Translated_Book/ch01/welcome.ipynb 21.17KB
  122. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch01/._welcome.ipynb 212B
  123. llms-from-scratch-cn-main/Translated_Book/ch04/4.7 生成文本.ipynb 6.47KB
  124. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.7 生成文本.ipynb 212B
  125. llms-from-scratch-cn-main/Translated_Book/ch04/4.5 在transfomer模块中连接注意力层和线性层.ipynb 12.37KB
  126. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.5 在transfomer模块中连接注意力层和线性层.ipynb 212B
  127. llms-from-scratch-cn-main/Translated_Book/ch04/.keep
  128. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._.keep 212B
  129. llms-from-scratch-cn-main/Translated_Book/ch04/4.6 编码GPT模型.ipynb 15.42KB
  130. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.6 编码GPT模型.ipynb 212B
  131. llms-from-scratch-cn-main/Translated_Book/ch04/4.1.ipynb 17.16KB
  132. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.1.ipynb 212B
  133. llms-from-scratch-cn-main/Translated_Book/ch04/4.3 实现使用 GELU 激活函数的前馈网络.ipynb 56.12KB
  134. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.3 实现使用 GELU 激活函数的前馈网络.ipynb 212B
  135. llms-from-scratch-cn-main/Translated_Book/ch04/4.2 使用层归一化对激活进行归一化.ipynb 15.38KB
  136. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.2 使用层归一化对激活进行归一化.ipynb 212B
  137. llms-from-scratch-cn-main/Translated_Book/ch04/4.2.ipynb 15.38KB
  138. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.2.ipynb 212B
  139. llms-from-scratch-cn-main/Translated_Book/ch04/4.4 增加快捷链接.ipynb 11.6KB
  140. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.4 增加快捷链接.ipynb 212B
  141. llms-from-scratch-cn-main/Translated_Book/ch04/4.1 从头开始实现 GPT 模型以生成文本.ipynb 17.16KB
  142. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.1 从头开始实现 GPT 模型以生成文本.ipynb 212B
  143. llms-from-scratch-cn-main/Translated_Book/ch04/4.6 编码GPT模型-Copy1.ipynb 15.42KB
  144. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch04/._4.6 编码GPT模型-Copy1.ipynb 212B
  145. llms-from-scratch-cn-main/Translated_Book/ch03/3.1.ipynb 9.1KB
  146. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch03/._3.1.ipynb 212B
  147. llms-from-scratch-cn-main/Translated_Book/ch03/3.3.ipynb 25.13KB
  148. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch03/._3.3.ipynb 212B
  149. llms-from-scratch-cn-main/Translated_Book/ch03/3.7.ipynb 2.4KB
  150. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch03/._3.7.ipynb 212B
  151. llms-from-scratch-cn-main/Translated_Book/ch03/3.5.ipynb 27.44KB
  152. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch03/._3.5.ipynb 212B
  153. llms-from-scratch-cn-main/Translated_Book/ch03/3.2.ipynb 3.82KB
  154. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch03/._3.2.ipynb 212B
  155. llms-from-scratch-cn-main/Translated_Book/ch03/3.4.ipynb 25.56KB
  156. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch03/._3.4.ipynb 212B
  157. llms-from-scratch-cn-main/Translated_Book/ch03/.keep
  158. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch03/._.keep 212B
  159. llms-from-scratch-cn-main/Translated_Book/ch03/3.6.ipynb 23.54KB
  160. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch03/._3.6.ipynb 212B
  161. llms-from-scratch-cn-main/Translated_Book/img/fig-A-1.jpg 94.35KB
  162. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-1.jpg 212B
  163. llms-from-scratch-cn-main/Translated_Book/img/fig-4-12.jpg 138.23KB
  164. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-12.jpg 212B
  165. llms-from-scratch-cn-main/Translated_Book/img/fig-4-13.jpg 167.87KB
  166. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-13.jpg 212B
  167. llms-from-scratch-cn-main/Translated_Book/img/fig-3-26.jpg 115.67KB
  168. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-26.jpg 212B
  169. llms-from-scratch-cn-main/Translated_Book/img/fig-4-9.jpg 129.5KB
  170. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-9.jpg 212B
  171. llms-from-scratch-cn-main/Translated_Book/img/fig-A-2.jpg 131.44KB
  172. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-2.jpg 212B
  173. llms-from-scratch-cn-main/Translated_Book/img/fig-3-24.jpg 128.16KB
  174. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-24.jpg 212B
  175. llms-from-scratch-cn-main/Translated_Book/img/fig-3-18.jpg 196.89KB
  176. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-18.jpg 212B
  177. llms-from-scratch-cn-main/Translated_Book/img/fig-4-11.jpg 93.37KB
  178. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-11.jpg 212B
  179. llms-from-scratch-cn-main/Translated_Book/img/fig-3-19.jpg 126.72KB
  180. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-19.jpg 212B
  181. llms-from-scratch-cn-main/Translated_Book/img/fig-4-10.jpg 167.49KB
  182. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-10.jpg 212B
  183. llms-from-scratch-cn-main/Translated_Book/img/fig-3-25.jpg 83.88KB
  184. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-25.jpg 212B
  185. llms-from-scratch-cn-main/Translated_Book/img/fig-A-3.jpg 104.26KB
  186. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-3.jpg 212B
  187. llms-from-scratch-cn-main/Translated_Book/img/fig-4-8.jpg 61.73KB
  188. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-8.jpg 212B
  189. llms-from-scratch-cn-main/Translated_Book/img/fig-A-7.jpg 47.44KB
  190. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-7.jpg 212B
  191. llms-from-scratch-cn-main/Translated_Book/img/fig-4-14.jpg 87.46KB
  192. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-14.jpg 212B
  193. llms-from-scratch-cn-main/Translated_Book/img/fig-3-21.jpg 43.3KB
  194. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-21.jpg 212B
  195. llms-from-scratch-cn-main/Translated_Book/img/fig-3-20.jpg 45.71KB
  196. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-20.jpg 212B
  197. llms-from-scratch-cn-main/Translated_Book/img/fig-4-15.jpg 168.83KB
  198. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-15.jpg 212B
  199. llms-from-scratch-cn-main/Translated_Book/img/fig-A-6.jpg 70.01KB
  200. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-6.jpg 212B
  201. llms-from-scratch-cn-main/Translated_Book/img/fig-A-4.jpg 74.29KB
  202. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-4.jpg 212B
  203. llms-from-scratch-cn-main/Translated_Book/img/fig-4-17.jpg 112.79KB
  204. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-17.jpg 212B
  205. llms-from-scratch-cn-main/Translated_Book/img/fig-3-22.jpg 172.46KB
  206. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-22.jpg 212B
  207. llms-from-scratch-cn-main/Translated_Book/img/fig-2-9.jpg 81.89KB
  208. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-9.jpg 212B
  209. llms-from-scratch-cn-main/Translated_Book/img/fig-2-8.jpg 109.97KB
  210. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-8.jpg 212B
  211. llms-from-scratch-cn-main/Translated_Book/img/fig-3-23.jpg 61.8KB
  212. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-23.jpg 212B
  213. llms-from-scratch-cn-main/Translated_Book/img/fig-4-16.jpg 155.7KB
  214. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-16.jpg 212B
  215. llms-from-scratch-cn-main/Translated_Book/img/fig-A-5.jpg 79.79KB
  216. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-5.jpg 212B
  217. llms-from-scratch-cn-main/Translated_Book/img/fig-2-10.jpg 150.51KB
  218. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-10.jpg 212B
  219. llms-from-scratch-cn-main/Translated_Book/img/fig-5-9.jpg 213.86KB
  220. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-9.jpg 212B
  221. llms-from-scratch-cn-main/Translated_Book/img/fig-5-8.jpg 95.4KB
  222. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-8.jpg 212B
  223. llms-from-scratch-cn-main/Translated_Book/img/fig-2-11.jpg 90.18KB
  224. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-11.jpg 212B
  225. llms-from-scratch-cn-main/Translated_Book/img/fig-2-13.jpg 107.2KB
  226. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-13.jpg 212B
  227. llms-from-scratch-cn-main/Translated_Book/img/fig-2-12.jpg 133.53KB
  228. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-12.jpg 212B
  229. llms-from-scratch-cn-main/Translated_Book/img/fig-3-9.jpg 66.19KB
  230. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-9.jpg 212B
  231. llms-from-scratch-cn-main/Translated_Book/img/fig-2-16.jpg 115.53KB
  232. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-16.jpg 212B
  233. llms-from-scratch-cn-main/Translated_Book/img/fig-2-17.jpg 125.99KB
  234. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-17.jpg 212B
  235. llms-from-scratch-cn-main/Translated_Book/img/fig-D-1.jpg 50.15KB
  236. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-D-1.jpg 212B
  237. llms-from-scratch-cn-main/Translated_Book/img/fig-3-8.jpg 79.49KB
  238. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-8.jpg 212B
  239. llms-from-scratch-cn-main/Translated_Book/img/.keep 1B
  240. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._.keep 212B
  241. llms-from-scratch-cn-main/Translated_Book/img/fig-2-15.jpg 86.82KB
  242. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-15.jpg 212B
  243. llms-from-scratch-cn-main/Translated_Book/img/fig-1-8.jpg 104.5KB
  244. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-8.jpg 212B
  245. llms-from-scratch-cn-main/Translated_Book/img/fig-1-9.jpg 97.65KB
  246. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-9.jpg 212B
  247. llms-from-scratch-cn-main/Translated_Book/img/fig-2-14.jpg 135.22KB
  248. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-14.jpg 212B
  249. llms-from-scratch-cn-main/Translated_Book/img/fig-D-2.jpg 59.13KB
  250. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-D-2.jpg 212B
  251. llms-from-scratch-cn-main/Translated_Book/img/fig-3-6.jpg 101.75KB
  252. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-6.jpg 212B
  253. llms-from-scratch-cn-main/Translated_Book/img/fig-2-19.jpg 147.47KB
  254. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-19.jpg 212B
  255. llms-from-scratch-cn-main/Translated_Book/img/fig-5-10.jpg 93.3KB
  256. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-10.jpg 212B
  257. llms-from-scratch-cn-main/Translated_Book/img/fig-1-4.jpg 126.46KB
  258. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-4.jpg 212B
  259. llms-from-scratch-cn-main/Translated_Book/img/fig-3-6.png 87.65KB
  260. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-6.png 212B
  261. llms-from-scratch-cn-main/Translated_Book/img/fig-5-1.jpg 102.06KB
  262. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-1.jpg 212B
  263. llms-from-scratch-cn-main/Translated_Book/img/fig-1-5.jpg 105.8KB
  264. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-5.jpg 212B
  265. llms-from-scratch-cn-main/Translated_Book/img/fig-2-18.jpg 69.81KB
  266. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-18.jpg 212B
  267. llms-from-scratch-cn-main/Translated_Book/img/fig-5-11.jpg 202.93KB
  268. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-11.jpg 212B
  269. llms-from-scratch-cn-main/Translated_Book/img/fig-5-11.png 208.53KB
  270. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-11.png 212B
  271. llms-from-scratch-cn-main/Translated_Book/img/fig-3-7.jpg 112.03KB
  272. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-7.jpg 212B
  273. llms-from-scratch-cn-main/Translated_Book/img/fig-3-5.jpg 83.27KB
  274. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-5.jpg 212B
  275. llms-from-scratch-cn-main/Translated_Book/img/fig-5-13.png 182.67KB
  276. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-13.png 212B
  277. llms-from-scratch-cn-main/Translated_Book/img/fig-5-13.jpg 91.33KB
  278. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-13.jpg 212B
  279. llms-from-scratch-cn-main/Translated_Book/img/fig-1-7.jpg 56.11KB
  280. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-7.jpg 212B
  281. llms-from-scratch-cn-main/Translated_Book/img/fig-3-5.png 152.63KB
  282. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-5.png 212B
  283. llms-from-scratch-cn-main/Translated_Book/img/fig-5-3.png 117.99KB
  284. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-3.png 212B
  285. llms-from-scratch-cn-main/Translated_Book/img/fig-5-2.jpg 102KB
  286. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-2.jpg 212B
  287. llms-from-scratch-cn-main/Translated_Book/img/fig-3-4.png 134.22KB
  288. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-4.png 212B
  289. llms-from-scratch-cn-main/Translated_Book/img/fig-5-12.jpg 70.08KB
  290. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-12.jpg 212B
  291. llms-from-scratch-cn-main/Translated_Book/img/fig-1-6.png 132.5KB
  292. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-6.png 212B
  293. llms-from-scratch-cn-main/Translated_Book/img/fig-5-12.png 69.83KB
  294. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-12.png 212B
  295. llms-from-scratch-cn-main/Translated_Book/img/fig-3-4.jpg 70.08KB
  296. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-4.jpg 212B
  297. llms-from-scratch-cn-main/Translated_Book/img/fig-1-2.jpg 101.12KB
  298. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-2.jpg 212B
  299. llms-from-scratch-cn-main/Translated_Book/img/fig-5-16.jpg 94.51KB
  300. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-16.jpg 212B
  301. llms-from-scratch-cn-main/Translated_Book/img/fig-5-6.jpg 93.44KB
  302. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-6.jpg 212B
  303. llms-from-scratch-cn-main/Translated_Book/img/fig-5-7.jpg 95.7KB
  304. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-7.jpg 212B
  305. llms-from-scratch-cn-main/Translated_Book/img/cover-1.jpg 83.35KB
  306. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._cover-1.jpg 212B
  307. llms-from-scratch-cn-main/Translated_Book/img/fig-3-1.png 225.12KB
  308. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-1.png 212B
  309. llms-from-scratch-cn-main/Translated_Book/img/fig-5-17.jpg 137.73KB
  310. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-17.jpg 212B
  311. llms-from-scratch-cn-main/Translated_Book/img/fig-1-3.jpg 104.25KB
  312. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-3.jpg 212B
  313. llms-from-scratch-cn-main/Translated_Book/img/fig-3-1.jpg 76.02KB
  314. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-1.jpg 212B
  315. llms-from-scratch-cn-main/Translated_Book/img/fig-3-3.jpg 113.68KB
  316. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-3.jpg 212B
  317. llms-from-scratch-cn-main/Translated_Book/img/fig-2-20.png 86.1KB
  318. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-20.png 212B
  319. llms-from-scratch-cn-main/Translated_Book/img/fig-1-1.jpg 68.49KB
  320. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1-1.jpg 212B
  321. llms-from-scratch-cn-main/Translated_Book/img/fig-5-15.jpg 108.58KB
  322. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-15.jpg 212B
  323. llms-from-scratch-cn-main/Translated_Book/img/fig-3-3.png 202.77KB
  324. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-3.png 212B
  325. llms-from-scratch-cn-main/Translated_Book/img/fig-5-5.png 157.49KB
  326. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-5.png 212B
  327. llms-from-scratch-cn-main/Translated_Book/img/fig-5-4.jpg 111.83KB
  328. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-4.jpg 212B
  329. llms-from-scratch-cn-main/Translated_Book/img/cover-2.jpg 83.55KB
  330. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._cover-2.jpg 212B
  331. llms-from-scratch-cn-main/Translated_Book/img/fig-3-2.png 188KB
  332. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-2.png 212B
  333. llms-from-scratch-cn-main/Translated_Book/img/fig-5-14.jpg 72.42KB
  334. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-5-14.jpg 212B
  335. llms-from-scratch-cn-main/Translated_Book/img/fig-2-21.png 138.58KB
  336. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-21.png 212B
  337. llms-from-scratch-cn-main/Translated_Book/img/fig-3-2.jpg 81.32KB
  338. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-2.jpg 212B
  339. llms-from-scratch-cn-main/Translated_Book/img/fig-4-3.jpg 114.82KB
  340. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-3.jpg 212B
  341. llms-from-scratch-cn-main/Translated_Book/img/Figure 1.2.png 67.07KB
  342. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._Figure 1.2.png 212B
  343. llms-from-scratch-cn-main/Translated_Book/img/fig-A-10.jpg 80.32KB
  344. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-10.jpg 212B
  345. llms-from-scratch-cn-main/Translated_Book/img/fig-4-3.png 209.49KB
  346. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-3.png 212B
  347. llms-from-scratch-cn-main/Translated_Book/img/fig-A-8.jpg 118.12KB
  348. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-8.jpg 212B
  349. llms-from-scratch-cn-main/Translated_Book/img/fig-3-12.jpg 85.3KB
  350. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-12.jpg 212B
  351. llms-from-scratch-cn-main/Translated_Book/img/fig-2-5.jpg 42.94KB
  352. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-5.jpg 212B
  353. llms-from-scratch-cn-main/Translated_Book/img/fig-2-4.jpg 87.74KB
  354. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-4.jpg 212B
  355. llms-from-scratch-cn-main/Translated_Book/img/fig-3-13.jpg 109.86KB
  356. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-13.jpg 212B
  357. llms-from-scratch-cn-main/Translated_Book/img/fig-A-9.jpg 197.05KB
  358. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-9.jpg 212B
  359. llms-from-scratch-cn-main/Translated_Book/img/fig-4-2.png 121.98KB
  360. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-2.png 212B
  361. llms-from-scratch-cn-main/Translated_Book/img/fig-A-11.jpg 96.81KB
  362. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-11.jpg 212B
  363. llms-from-scratch-cn-main/Translated_Book/img/Figure 1.3.png 87.5KB
  364. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._Figure 1.3.png 212B
  365. llms-from-scratch-cn-main/Translated_Book/img/fig-4-2.jpg 123.38KB
  366. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-2.jpg 212B
  367. llms-from-scratch-cn-main/Translated_Book/img/Figure 1.1.png 54.69KB
  368. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._Figure 1.1.png 212B
  369. llms-from-scratch-cn-main/Translated_Book/img/fig-A-13.jpg 65.57KB
  370. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-13.jpg 212B
  371. llms-from-scratch-cn-main/Translated_Book/img/fig-4-18.jpg 101.23KB
  372. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-18.jpg 212B
  373. llms-from-scratch-cn-main/Translated_Book/img/fig-3-11.jpg 112.16KB
  374. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-11.jpg 212B
  375. llms-from-scratch-cn-main/Translated_Book/img/fig-2-6.jpg 134.34KB
  376. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-6.jpg 212B
  377. llms-from-scratch-cn-main/Translated_Book/img/fig-1.7-1.jpg 939.45KB
  378. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-1.7-1.jpg 212B
  379. llms-from-scratch-cn-main/Translated_Book/img/fig-2-7.jpg 120.84KB
  380. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-7.jpg 212B
  381. llms-from-scratch-cn-main/Translated_Book/img/fig-3-10.jpg 87.2KB
  382. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-10.jpg 212B
  383. llms-from-scratch-cn-main/Translated_Book/img/fig-4-1.png 188.84KB
  384. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-1.png 212B
  385. llms-from-scratch-cn-main/Translated_Book/img/fig-A-12.jpg 71.63KB
  386. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-A-12.jpg 212B
  387. llms-from-scratch-cn-main/Translated_Book/img/fig-4-1.jpg 89.94KB
  388. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-1.jpg 212B
  389. llms-from-scratch-cn-main/Translated_Book/img/fig-4-5.jpg 153.75KB
  390. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-5.jpg 212B
  391. llms-from-scratch-cn-main/Translated_Book/img/Figure 1.4.png 129.01KB
  392. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._Figure 1.4.png 212B
  393. llms-from-scratch-cn-main/Translated_Book/img/fig-4-5.png 274.39KB
  394. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-5.png 212B
  395. llms-from-scratch-cn-main/Translated_Book/img/fig-2-3.jpg 78.54KB
  396. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-3.jpg 212B
  397. llms-from-scratch-cn-main/Translated_Book/img/fig-3-14.jpg 82.49KB
  398. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-14.jpg 212B
  399. llms-from-scratch-cn-main/Translated_Book/img/fig-3-15.jpg 72.77KB
  400. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-15.jpg 212B
  401. llms-from-scratch-cn-main/Translated_Book/img/fig-2-2.jpg 85.79KB
  402. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-2.jpg 212B
  403. llms-from-scratch-cn-main/Translated_Book/img/fig-4-4.png 218.43KB
  404. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-4.png 212B
  405. llms-from-scratch-cn-main/Translated_Book/img/Figure 1.5.png 109.51KB
  406. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._Figure 1.5.png 212B
  407. llms-from-scratch-cn-main/Translated_Book/img/fig-4-4.jpg 151KB
  408. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-4.jpg 212B
  409. llms-from-scratch-cn-main/Translated_Book/img/fig-4-6.jpg 148.58KB
  410. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-6.jpg 212B
  411. llms-from-scratch-cn-main/Translated_Book/img/fig-4-6.png 217.09KB
  412. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-6.png 212B
  413. llms-from-scratch-cn-main/Translated_Book/img/fig-3-17.jpg 88.76KB
  414. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-17.jpg 212B
  415. llms-from-scratch-cn-main/Translated_Book/img/fig-3-16.jpg 85.69KB
  416. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-3-16.jpg 212B
  417. llms-from-scratch-cn-main/Translated_Book/img/fig-2-1.jpg 106.27KB
  418. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-2-1.jpg 212B
  419. llms-from-scratch-cn-main/Translated_Book/img/fig-4-7.png 216.16KB
  420. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-7.png 212B
  421. llms-from-scratch-cn-main/Translated_Book/img/Figure 1.6.png 76.27KB
  422. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._Figure 1.6.png 212B
  423. llms-from-scratch-cn-main/Translated_Book/img/fig-4-7.jpg 92.46KB
  424. __MACOSX/llms-from-scratch-cn-main/Translated_Book/img/._fig-4-7.jpg 212B
  425. llms-from-scratch-cn-main/Translated_Book/ch02/2.1理解词嵌入.ipynb 6.61KB
  426. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.1理解词嵌入.ipynb 212B
  427. llms-from-scratch-cn-main/Translated_Book/ch02/2.5 字节对编码(BPE).ipynb 101.49KB
  428. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.5 字节对编码(BPE).ipynb 212B
  429. llms-from-scratch-cn-main/Translated_Book/ch02/2.8词位置编码.ipynb 8.11KB
  430. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.8词位置编码.ipynb 212B
  431. llms-from-scratch-cn-main/Translated_Book/ch02/2.6使用滑动窗口进行数据采样.ipynb 20.38KB
  432. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.6使用滑动窗口进行数据采样.ipynb 212B
  433. llms-from-scratch-cn-main/Translated_Book/ch02/.keep
  434. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._.keep 212B
  435. llms-from-scratch-cn-main/Translated_Book/ch02/2.7 构建词符嵌入.ipynb 6.24KB
  436. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.7 构建词符嵌入.ipynb 212B
  437. llms-from-scratch-cn-main/Translated_Book/ch02/2.文本数据处理.ipynb 3.25KB
  438. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.文本数据处理.ipynb 212B
  439. llms-from-scratch-cn-main/Translated_Book/ch02/2.2文本分词(序列化).ipynb 13.23KB
  440. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.2文本分词(序列化).ipynb 212B
  441. llms-from-scratch-cn-main/Translated_Book/ch02/2.3将令牌转换为令牌 ID.ipynb 16.38KB
  442. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.3将令牌转换为令牌 ID.ipynb 212B
  443. llms-from-scratch-cn-main/Translated_Book/ch02/2.4添加特殊上下文tokens.ipynb 13.69KB
  444. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch02/._2.4添加特殊上下文tokens.ipynb 212B
  445. llms-from-scratch-cn-main/Translated_Book/ch05/5.1 在未标记的数据上进行预训练.ipynb 63.19KB
  446. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch05/._5.1 在未标记的数据上进行预训练.ipynb 212B
  447. llms-from-scratch-cn-main/Translated_Book/ch05/.keep
  448. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch05/._.keep 212B
  449. llms-from-scratch-cn-main/Translated_Book/ch05/5.3.ipynb 23.63KB
  450. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch05/._5.3.ipynb 212B
  451. llms-from-scratch-cn-main/Translated_Book/ch05/5.2.ipynb 14.26KB
  452. __MACOSX/llms-from-scratch-cn-main/Translated_Book/ch05/._5.2.ipynb 212B
  453. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/llama3-from-scratch.ipynb 289.18KB
  454. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/._llama3-from-scratch.ipynb 212B
  455. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/LICENSE 1.05KB
  456. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/._LICENSE 212B
  457. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/requirements.txt 48B
  458. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/._requirements.txt 212B
  459. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/
  460. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/._images 212B
  461. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/params.txt 182B
  462. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/._params.txt 212B
  463. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/params.json 212B
  464. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/._params.json 212B
  465. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/tokenizer.model 2.08MB
  466. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/._tokenizer.model 212B
  467. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/README.md 44.81KB
  468. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/._README.md 212B
  469. llms-from-scratch-cn-main/Model_Architecture_Discussions/phi-3/modeling_phi3.py 70.42KB
  470. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/phi-3/._modeling_phi3.py 212B
  471. llms-from-scratch-cn-main/Model_Architecture_Discussions/phi-3/phi-3.ipynb 8.7KB
  472. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/phi-3/._phi-3.ipynb 212B
  473. llms-from-scratch-cn-main/Model_Architecture_Discussions/phi-3/configuration_phi3.py 9.25KB
  474. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/phi-3/._configuration_phi3.py 212B
  475. llms-from-scratch-cn-main/Model_Architecture_Discussions/olmo/configuration_olmo.py 7.81KB
  476. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/olmo/._configuration_olmo.py 212B
  477. llms-from-scratch-cn-main/Model_Architecture_Discussions/olmo/olmo.ipynb 6.33KB
  478. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/olmo/._olmo.ipynb 212B
  479. llms-from-scratch-cn-main/Model_Architecture_Discussions/olmo/modeling_olmo.py 57.71KB
  480. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/olmo/._modeling_olmo.py 212B
  481. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/configuration_minicpm.py 2.4KB
  482. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._configuration_minicpm.py 212B
  483. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/tokenizer_config.json 1.11KB
  484. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._tokenizer_config.json 212B
  485. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/special_tokens_map.json 414B
  486. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._special_tokens_map.json 212B
  487. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/MiniCPM.ipynb 56.76KB
  488. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._MiniCPM.ipynb 212B
  489. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/gitattributes 1.52KB
  490. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._gitattributes 212B
  491. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/config.json 712B
  492. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._config.json 212B
  493. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/tokenizer.json 5.92MB
  494. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._tokenizer.json 212B
  495. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/MiniCPM.py 31.54KB
  496. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._MiniCPM.py 212B
  497. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/generation_config.json 113B
  498. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._generation_config.json 212B
  499. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/tokenizer.model 1.9MB
  500. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._tokenizer.model 212B
  501. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/README.md 11.31KB
  502. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._README.md 212B
  503. llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/MiniCPMTest.ipynb 9.94KB
  504. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/MiniCPM/._MiniCPMTest.ipynb 212B
  505. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v1/model.py 21.5KB
  506. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v1/._model.py 212B
  507. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v1/readme.md 8.83KB
  508. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v1/._readme.md 212B
  509. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/RWKV_v6_demo.ipynb 15.14KB
  510. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/._RWKV_v6_demo.ipynb 212B
  511. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/rwkv_vocab_v20230424.txt 1.04MB
  512. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/._rwkv_vocab_v20230424.txt 212B
  513. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/img/
  514. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/._img 212B
  515. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/RWKV-v6-guide.ipynb 21.63KB
  516. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/._RWKV-v6-guide.ipynb 212B
  517. llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/tokenization_gptpangu_bak.py 4.58KB
  518. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/._tokenization_gptpangu_bak.py 212B
  519. llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/modeling_gptpangu.py 21.67KB
  520. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/._modeling_gptpangu.py 212B
  521. llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/pangu.ipynb 12.99KB
  522. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/._pangu.ipynb 212B
  523. llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/tokenization_gptpangu.py 4.15KB
  524. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/._tokenization_gptpangu.py 212B
  525. llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/configuration_gptpangu.py 1.83KB
  526. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/pangu/._configuration_gptpangu.py 212B
  527. llms-from-scratch-cn-main/Model_Architecture_Discussions/mamba/demo.ipynb 10.42KB
  528. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/mamba/._demo.ipynb 212B
  529. llms-from-scratch-cn-main/Model_Architecture_Discussions/mamba/model.py 12.17KB
  530. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/mamba/._model.py 212B
  531. llms-from-scratch-cn-main/Model_Architecture_Discussions/mamba/README.md 1.32KB
  532. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/mamba/._README.md 212B
  533. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/model_v5.py 9.98KB
  534. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/._model_v5.py 212B
  535. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/model_v1.py 21.5KB
  536. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/._model_v1.py 212B
  537. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/model_v4.py 6.42KB
  538. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/._model_v4.py 212B
  539. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/readme.md 11.02KB
  540. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/._readme.md 212B
  541. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/model_v3.py 8.92KB
  542. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/._model_v3.py 212B
  543. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/model_v6.py 9.43KB
  544. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/._model_v6.py 212B
  545. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/model_v2.py 8.03KB
  546. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-compare/._model_v2.py 212B
  547. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/chatglm4.ipynb 11.67KB
  548. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/._chatglm4.ipynb 212B
  549. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/configuration_chatglm.py 2.21KB
  550. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/._configuration_chatglm.py 212B
  551. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/tokenization_chatglm.py 15.28KB
  552. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/._tokenization_chatglm.py 212B
  553. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/chatglm4-guide.ipynb 188.91KB
  554. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/._chatglm4-guide.ipynb 212B
  555. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/modeling_chatglm.py 51.74KB
  556. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM4/._modeling_chatglm.py 212B
  557. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/glm.py 46.9KB
  558. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._glm.py 212B
  559. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/tokenizer_config.json 1.38KB
  560. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._tokenizer_config.json 212B
  561. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/quantization.py 14.32KB
  562. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._quantization.py 212B
  563. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/tokenization_chatglm.py 12.69KB
  564. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._tokenization_chatglm.py 212B
  565. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/configuration_chatglm_full.py 1.07KB
  566. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._configuration_chatglm_full.py 212B
  567. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/tokenizer.model 994.5KB
  568. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._tokenizer.model 212B
  569. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/README.md 1.43KB
  570. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._README.md 212B
  571. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/img/
  572. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._img 212B
  573. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/加载模型权重.ipynb 79.31KB
  574. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/._加载模型权重.ipynb 212B
  575. llms-from-scratch-cn-main/Model_Architecture_Discussions/img/.keep 1B
  576. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/img/._.keep 212B
  577. llms-from-scratch-cn-main/Model_Architecture_Discussions/openelm/openelm.ipynb 11.18KB
  578. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/openelm/._openelm.ipynb 212B
  579. llms-from-scratch-cn-main/Model_Architecture_Discussions/openelm/configuration_openelm.py 13.83KB
  580. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/openelm/._configuration_openelm.py 212B
  581. llms-from-scratch-cn-main/Model_Architecture_Discussions/openelm/modeling_openelm.py 38.32KB
  582. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/openelm/._modeling_openelm.py 212B
  583. llms-from-scratch-cn-main/Model_Architecture_Discussions/gptj/gptj.ipynb 8.33KB
  584. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/gptj/._gptj.ipynb 212B
  585. llms-from-scratch-cn-main/Model_Architecture_Discussions/gptj/modeling_gptj.py 60.99KB
  586. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/gptj/._modeling_gptj.py 212B
  587. llms-from-scratch-cn-main/Model_Architecture_Discussions/gptj/configuration_gptj.py 7.99KB
  588. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/gptj/._configuration_gptj.py 212B
  589. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/model_run.py 11.06KB
  590. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/._model_run.py 212B
  591. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/20B_tokenizer.json 2.35MB
  592. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/._20B_tokenizer.json 212B
  593. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/model.py 8.97KB
  594. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/._model.py 212B
  595. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/rwkv-v3-guide.ipynb 27.67KB
  596. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/._rwkv-v3-guide.ipynb 212B
  597. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/rwkv-v3.ipynb 10.57KB
  598. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/._rwkv-v3.ipynb 212B
  599. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/utils.py 3.98KB
  600. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v3/._utils.py 212B
  601. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v4/20B_tokenizer.json 2.35MB
  602. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v4/._20B_tokenizer.json 212B
  603. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v4/rwkv-v4-guide.ipynb 21.21KB
  604. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v4/._rwkv-v4-guide.ipynb 212B
  605. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/rwkv_vocab_v20230424.txt 1.04MB
  606. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/._rwkv_vocab_v20230424.txt 212B
  607. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/img/
  608. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/._img 212B
  609. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/RWKV_v5_demo.ipynb 21.4KB
  610. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/._RWKV_v5_demo.ipynb 212B
  611. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/RWKV-v5-guide.ipynb 42.41KB
  612. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/._RWKV-v5-guide.ipynb 212B
  613. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/20B_tokenizer.json 2.35MB
  614. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/._20B_tokenizer.json 212B
  615. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/rwkv-v2-guide.ipynb 33.25KB
  616. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/._rwkv-v2-guide.ipynb 212B
  617. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/model.py 8.03KB
  618. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/._model.py 212B
  619. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/img/
  620. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/._img 212B
  621. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/rwkv-v2.ipynb 35.39KB
  622. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/._rwkv-v2.ipynb 212B
  623. llms-from-scratch-cn-main/Model_Architecture_Discussions/phi/modeling_phi.py 66.49KB
  624. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/phi/._modeling_phi.py 212B
  625. llms-from-scratch-cn-main/Model_Architecture_Discussions/phi/configuration_phi.py 8.26KB
  626. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/phi/._configuration_phi.py 212B
  627. llms-from-scratch-cn-main/Model_Architecture_Discussions/phi/phi.ipynb 13.82KB
  628. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/phi/._phi.ipynb 212B
  629. llms-from-scratch-cn-main/Book/ch06/.keep
  630. __MACOSX/llms-from-scratch-cn-main/Book/ch06/._.keep 212B
  631. llms-from-scratch-cn-main/Book/ch01/.keep
  632. __MACOSX/llms-from-scratch-cn-main/Book/ch01/._.keep 212B
  633. llms-from-scratch-cn-main/Book/ch04/.keep
  634. __MACOSX/llms-from-scratch-cn-main/Book/ch04/._.keep 212B
  635. llms-from-scratch-cn-main/Book/ch03/.keep
  636. __MACOSX/llms-from-scratch-cn-main/Book/ch03/._.keep 212B
  637. llms-from-scratch-cn-main/Book/ch02/.keep
  638. __MACOSX/llms-from-scratch-cn-main/Book/ch02/._.keep 212B
  639. llms-from-scratch-cn-main/Book/ch05/.keep
  640. __MACOSX/llms-from-scratch-cn-main/Book/ch05/._.keep 212B
  641. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/
  642. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/._01_main-chapter-code 212B
  643. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/
  644. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/._03_model-evaluation 212B
  645. llms-from-scratch-cn-main/Codes/ch07/05_dataset-generation/
  646. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/._05_dataset-generation 212B
  647. llms-from-scratch-cn-main/Codes/ch07/README.md 740B
  648. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/._README.md 212B
  649. llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/
  650. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/._02_dataset-utilities 212B
  651. llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/
  652. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/._04_preference-tuning-with-dpo 212B
  653. llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/
  654. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/._01_main-chapter-code 212B
  655. llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/
  656. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/._02_bonus_additional-experiments 212B
  657. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/
  658. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/._03_bonus_imdb-classification 212B
  659. llms-from-scratch-cn-main/Codes/ch01/README.md 84B
  660. __MACOSX/llms-from-scratch-cn-main/Codes/ch01/._README.md 212B
  661. llms-from-scratch-cn-main/Codes/appendix-B/README.md 829B
  662. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-B/._README.md 212B
  663. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/
  664. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/._01_main-chapter-code 212B
  665. llms-from-scratch-cn-main/Codes/ch04/README.md 147B
  666. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/._README.md 212B
  667. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/
  668. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/._01_main-chapter-code 212B
  669. llms-from-scratch-cn-main/Codes/ch03/README.md 120B
  670. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/._README.md 212B
  671. llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/
  672. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/._02_bonus_bytepair-encoder 212B
  673. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/
  674. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/._03_bonus_embedding-vs-matmul 212B
  675. llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/
  676. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/._01_main-chapter-code 212B
  677. llms-from-scratch-cn-main/Codes/ch02/README.md 500B
  678. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/._README.md 212B
  679. llms-from-scratch-cn-main/Codes/ch02/09_summary/
  680. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/._09_summary 212B
  681. llms-from-scratch-cn-main/Codes/ch05/04_learning_rate_schedulers/
  682. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/._04_learning_rate_schedulers 212B
  683. llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/
  684. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/._03_bonus_pretraining_on_gutenberg 212B
  685. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/
  686. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/._01_main-chapter-code 212B
  687. llms-from-scratch-cn-main/Codes/ch05/README.md 600B
  688. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/._README.md 212B
  689. llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/
  690. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/._05_bonus_hparam_tuning 212B
  691. llms-from-scratch-cn-main/Codes/ch05/02_alternative_weight_loading/
  692. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/._02_alternative_weight_loading 212B
  693. llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/
  694. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/._03_main-chapter-code 212B
  695. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/
  696. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/._01_optional-python-setup-preferences 212B
  697. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/
  698. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/._02_installing-python-libraries 212B
  699. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/archi.png 845.81KB
  700. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._archi.png 212B
  701. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/ropesplit.png 401.41KB
  702. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._ropesplit.png 212B
  703. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/tokens.png 488.49KB
  704. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._tokens.png 212B
  705. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/keys.png 430.16KB
  706. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._keys.png 212B
  707. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/embeddings.png 470.5KB
  708. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._embeddings.png 212B
  709. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/attention.png 202.27KB
  710. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._attention.png 212B
  711. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/implllama3_39_0.png 26.96KB
  712. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._implllama3_39_0.png 212B
  713. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/implllama3_41_0.png 25.82KB
  714. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._implllama3_41_0.png 212B
  715. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/heads.png 799.73KB
  716. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._heads.png 212B
  717. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/last_norm.png 1003.83KB
  718. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._last_norm.png 212B
  719. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/god.png 1.21MB
  720. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._god.png 212B
  721. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/qkv.png 497.17KB
  722. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._qkv.png 212B
  723. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/swiglu.png 604.83KB
  724. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._swiglu.png 212B
  725. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/freq_cis.png 813.92KB
  726. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._freq_cis.png 212B
  727. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/model.png 658.84KB
  728. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._model.png 212B
  729. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/implllama3_42_0.png 27.37KB
  730. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._implllama3_42_0.png 212B
  731. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/rms.png 340.74KB
  732. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._rms.png 212B
  733. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/softmax.png 190.99KB
  734. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._softmax.png 212B
  735. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/value.png 199.91KB
  736. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._value.png 212B
  737. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/weightmatrix.png 379.86KB
  738. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._weightmatrix.png 212B
  739. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/qsplit.png 551.01KB
  740. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._qsplit.png 212B
  741. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/keys0.png 422.6KB
  742. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._keys0.png 212B
  743. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/q_per_token.png 483.94KB
  744. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._q_per_token.png 212B
  745. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/implllama3_30_0.png 48.6KB
  746. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._implllama3_30_0.png 212B
  747. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/finallayer.png 799.14KB
  748. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._finallayer.png 212B
  749. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/norm.png 308.67KB
  750. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._norm.png 212B
  751. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/stacked.png 383.59KB
  752. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._stacked.png 212B
  753. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/v0.png 188.19KB
  754. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._v0.png 212B
  755. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/a10.png 633.97KB
  756. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._a10.png 212B
  757. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/42.png 772.73KB
  758. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._42.png 212B
  759. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/implllama3_54_0.png 27.37KB
  760. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._implllama3_54_0.png 212B
  761. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/afterattention.png 289.26KB
  762. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._afterattention.png 212B
  763. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/rope.png 516.22KB
  764. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._rope.png 212B
  765. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/norm_after.png 297.39KB
  766. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._norm_after.png 212B
  767. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/mask.png 471.46KB
  768. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._mask.png 212B
  769. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/implllama3_52_0.png 25.81KB
  770. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._implllama3_52_0.png 212B
  771. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/implllama3_50_0.png 26.94KB
  772. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._implllama3_50_0.png 212B
  773. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/karpathyminbpe.png 787.45KB
  774. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._karpathyminbpe.png 212B
  775. llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/qkmatmul.png 189.33KB
  776. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/llama3/images/._qkmatmul.png 212B
  777. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/img/01.png 100.34KB
  778. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v6/img/._01.png 212B
  779. llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/img/img.png 111.38KB
  780. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/ChatGLM3/img/._img.png 212B
  781. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/img/01.png 100.34KB
  782. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v5/img/._01.png 212B
  783. llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/img/01.png 231.82KB
  784. __MACOSX/llms-from-scratch-cn-main/Model_Architecture_Discussions/rwkv-v2/img/._01.png 212B
  785. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/exercise-solutions.ipynb 36.83KB
  786. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._exercise-solutions.ipynb 212B
  787. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/ch07.ipynb 125.91KB
  788. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._ch07.ipynb 212B
  789. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/previous_chapters.py 17.6KB
  790. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._previous_chapters.py 212B
  791. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/instruction-data.json 198.75KB
  792. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._instruction-data.json 212B
  793. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/exercise_experiments.py 18.98KB
  794. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._exercise_experiments.py 212B
  795. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/ollama_evaluate.py 3.88KB
  796. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._ollama_evaluate.py 212B
  797. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/load-finetuned-model.ipynb 6.01KB
  798. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._load-finetuned-model.ipynb 212B
  799. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/README.md 3.36KB
  800. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._README.md 212B
  801. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/gpt_download.py 5.61KB
  802. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._gpt_download.py 212B
  803. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/instruction-data-with-response.json 28.59KB
  804. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._instruction-data-with-response.json 212B
  805. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/tests.py 597B
  806. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._tests.py 212B
  807. llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/gpt_instruction_finetuning.py 11.14KB
  808. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/01_main-chapter-code/._gpt_instruction_finetuning.py 212B
  809. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/config.json 115B
  810. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/._config.json 212B
  811. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/eval-example-data.json 36.01KB
  812. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/._eval-example-data.json 212B
  813. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/
  814. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/._scores 212B
  815. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/README.md 1.02KB
  816. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/._README.md 212B
  817. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/llm-instruction-eval-openai.ipynb 20.12KB
  818. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/._llm-instruction-eval-openai.ipynb 212B
  819. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/llm-instruction-eval-ollama.ipynb 23.12KB
  820. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/._llm-instruction-eval-ollama.ipynb 212B
  821. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/requirements-extra.txt 28B
  822. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/._requirements-extra.txt 212B
  823. llms-from-scratch-cn-main/Codes/ch07/05_dataset-generation/llama3-ollama.ipynb 29.48KB
  824. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/05_dataset-generation/._llama3-ollama.ipynb 212B
  825. llms-from-scratch-cn-main/Codes/ch07/05_dataset-generation/README.md 295B
  826. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/05_dataset-generation/._README.md 212B
  827. llms-from-scratch-cn-main/Codes/ch07/05_dataset-generation/instruction-data-llama3-7b.json 10KB
  828. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/05_dataset-generation/._instruction-data-llama3-7b.json 212B
  829. llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/config.json 115B
  830. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/._config.json 212B
  831. llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/instruction-examples-modified.json 53.55KB
  832. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/._instruction-examples-modified.json 212B
  833. llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/README.md 2.16KB
  834. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/._README.md 212B
  835. llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/create-passive-voice-entries.ipynb 11.94KB
  836. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/._create-passive-voice-entries.ipynb 212B
  837. llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/instruction-examples.json 38.43KB
  838. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/._instruction-examples.json 212B
  839. llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/requirements-extra.txt 47B
  840. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/._requirements-extra.txt 212B
  841. llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/find-near-duplicates.py 5.08KB
  842. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/02_dataset-utilities/._find-near-duplicates.py 212B
  843. llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb 179.94KB
  844. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/._dpo-from-scratch.ipynb 212B
  845. llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/instruction-data-with-preference.json 377.9KB
  846. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/._instruction-data-with-preference.json 212B
  847. llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/previous_chapters.py 17.62KB
  848. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/._previous_chapters.py 212B
  849. llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/README.md 366B
  850. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/._README.md 212B
  851. llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/create-preference-data-ollama.ipynb 21.23KB
  852. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/04_preference-tuning-with-dpo/._create-preference-data-ollama.ipynb 212B
  853. llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/exercise-solutions.ipynb 5.1KB
  854. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/._exercise-solutions.ipynb 212B
  855. llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/previous_chapters.py 11.75KB
  856. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/._previous_chapters.py 212B
  857. llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/ch06.ipynb 137.77KB
  858. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/._ch06.ipynb 212B
  859. llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/README.md 700B
  860. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/._README.md 212B
  861. llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/gpt-class-finetune.py 15.34KB
  862. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/._gpt-class-finetune.py 212B
  863. llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/gpt_download.py 3.76KB
  864. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/._gpt_download.py 212B
  865. llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/tests.py 597B
  866. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/01_main-chapter-code/._tests.py 212B
  867. llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/previous_chapters.py 13.21KB
  868. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/._previous_chapters.py 212B
  869. llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/additional-experiments.py 20.45KB
  870. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/._additional-experiments.py 212B
  871. llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/README.md 8.64KB
  872. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/._README.md 212B
  873. llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/gpt_download.py 3.76KB
  874. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/02_bonus_additional-experiments/._gpt_download.py 212B
  875. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/train-sklearn-logreg.py 2.83KB
  876. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._train-sklearn-logreg.py 212B
  877. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/previous_chapters.py 11.75KB
  878. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._previous_chapters.py 212B
  879. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/sklearn-baseline.ipynb 7.88KB
  880. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._sklearn-baseline.ipynb 212B
  881. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/download-prepare-dataset.py 3.07KB
  882. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._download-prepare-dataset.py 212B
  883. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/README.md 3.44KB
  884. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._README.md 212B
  885. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/gpt_download.py 3.76KB
  886. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._gpt_download.py 212B
  887. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/train-bert-hf.py 10.71KB
  888. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._train-bert-hf.py 212B
  889. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/train-gpt.py 12.97KB
  890. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._train-gpt.py 212B
  891. llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/requirements-extra.txt 40B
  892. __MACOSX/llms-from-scratch-cn-main/Codes/ch06/03_bonus_imdb-classification/._requirements-extra.txt 212B
  893. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/exercise-solutions.ipynb 11.57KB
  894. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/._exercise-solutions.ipynb 212B
  895. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/previous_chapters.py 3.86KB
  896. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/._previous_chapters.py 212B
  897. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/ch04.ipynb 82.48KB
  898. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/._ch04.ipynb 212B
  899. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/README.md 502B
  900. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/._README.md 212B
  901. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/
  902. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/._figures 212B
  903. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/gpt.py 9.39KB
  904. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/._gpt.py 212B
  905. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/ch03.ipynb 71.89KB
  906. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/._ch03.ipynb 212B
  907. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/exercise-solutions.ipynb 7.86KB
  908. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/._exercise-solutions.ipynb 212B
  909. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/small-text-sample.txt 1.92KB
  910. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/._small-text-sample.txt 212B
  911. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/README.md 264B
  912. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/._README.md 212B
  913. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/
  914. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/._figures 212B
  915. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/multihead-attention.ipynb 15.62KB
  916. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/._multihead-attention.ipynb 212B
  917. llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/gpt2_model/
  918. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/._gpt2_model 212B
  919. llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/bpe_openai_gpt2.py 7.81KB
  920. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/._bpe_openai_gpt2.py 212B
  921. llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/README.md 233B
  922. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/._README.md 212B
  923. llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/compare-bpe-tiktoken.ipynb 10.8KB
  924. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/._compare-bpe-tiktoken.ipynb 212B
  925. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/
  926. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/._images 212B
  927. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/README.md 218B
  928. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/._README.md 212B
  929. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/embeddings-and-linear-layers.ipynb 12.33KB
  930. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/._embeddings-and-linear-layers.ipynb 212B
  931. llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/dataloader.ipynb 4.67KB
  932. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/._dataloader.ipynb 212B
  933. llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/exercise-solutions.ipynb 7.27KB
  934. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/._exercise-solutions.ipynb 212B
  935. llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/ch02.ipynb 45.42KB
  936. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/._ch02.ipynb 212B
  937. llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/README.md 221B
  938. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/._README.md 212B
  939. llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/the-verdict.txt 20KB
  940. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/01_main-chapter-code/._the-verdict.txt 212B
  941. llms-from-scratch-cn-main/Codes/ch02/09_summary/09_summary.ipynb 2.07KB
  942. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/09_summary/._09_summary.ipynb 212B
  943. llms-from-scratch-cn-main/Codes/ch05/04_learning_rate_schedulers/README.md 506B
  944. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/04_learning_rate_schedulers/._README.md 212B
  945. llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/prepare_dataset.py 2.82KB
  946. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/._prepare_dataset.py 212B
  947. llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/previous_chapters.py 11.02KB
  948. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/._previous_chapters.py 212B
  949. llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/README.md 6.14KB
  950. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/._README.md 212B
  951. llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/pretraining_simple.py 8.29KB
  952. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/03_bonus_pretraining_on_gutenberg/._pretraining_simple.py 212B
  953. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/ch05.ipynb 143.95KB
  954. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/._ch05.ipynb 212B
  955. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/images/
  956. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/._images 212B
  957. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/previous_chapters.py 9.35KB
  958. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/._previous_chapters.py 212B
  959. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/README.md 578B
  960. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/._README.md 212B
  961. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/gpt_train.py 7.91KB
  962. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/._gpt_train.py 212B
  963. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/gpt_download.py 3.49KB
  964. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/._gpt_download.py 212B
  965. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/gpt_generate.py 9.68KB
  966. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/._gpt_generate.py 212B
  967. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/tests.py 1.24KB
  968. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/._tests.py 212B
  969. llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/hparam_search.py 7.46KB
  970. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/._hparam_search.py 212B
  971. llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/previous_chapters.py 9.62KB
  972. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/._previous_chapters.py 212B
  973. llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/README.md 745B
  974. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/._README.md 212B
  975. llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/the-verdict.txt 20KB
  976. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/05_bonus_hparam_tuning/._the-verdict.txt 212B
  977. llms-from-scratch-cn-main/Codes/ch05/02_alternative_weight_loading/weight-loading-hf-transformers.ipynb 11.17KB
  978. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/02_alternative_weight_loading/._weight-loading-hf-transformers.ipynb 212B
  979. llms-from-scratch-cn-main/Codes/ch05/02_alternative_weight_loading/previous_chapters.py 9.88KB
  980. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/02_alternative_weight_loading/._previous_chapters.py 212B
  981. llms-from-scratch-cn-main/Codes/ch05/02_alternative_weight_loading/README.md 319B
  982. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/02_alternative_weight_loading/._README.md 212B
  983. llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/code-part2.ipynb 11.36KB
  984. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/._code-part2.ipynb 212B
  985. llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/exercise-solutions.ipynb 3.71KB
  986. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/._exercise-solutions.ipynb 212B
  987. llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/code-part1.ipynb 30.47KB
  988. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/._code-part1.ipynb 212B
  989. llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/DDP-script.py 5.09KB
  990. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/03_main-chapter-code/._DDP-script.py 212B
  991. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/README.md 3.48KB
  992. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/._README.md 212B
  993. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/
  994. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/._figures 212B
  995. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/requirements.txt 137B
  996. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/._requirements.txt 212B
  997. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/README.md 2.11KB
  998. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/._README.md 212B
  999. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/
  1000. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/._figures 212B
  1001. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/python_environment_check.ipynb 1.29KB
  1002. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/._python_environment_check.ipynb 212B
  1003. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/python_environment_check.py 2.22KB
  1004. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/._python_environment_check.py 212B
  1005. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/llama3-8b-model-2-response.json 393B
  1006. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/._llama3-8b-model-2-response.json 212B
  1007. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/llama3-8b-model-1-response.json 402B
  1008. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/._llama3-8b-model-1-response.json 212B
  1009. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/gpt4-model-1-response.json 445B
  1010. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/._gpt4-model-1-response.json 212B
  1011. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/gpt4-model-2-response.json 408B
  1012. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/._gpt4-model-2-response.json 212B
  1013. llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/correlation-analysis.ipynb 33.8KB
  1014. __MACOSX/llms-from-scratch-cn-main/Codes/ch07/03_model-evaluation/scores/._correlation-analysis.ipynb 212B
  1015. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/overview-after-ln.webp 20.73KB
  1016. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._overview-after-ln.webp 212B
  1017. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/gpt.webp 29.85KB
  1018. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._gpt.webp 212B
  1019. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/mental-model-final.webp 20.98KB
  1020. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._mental-model-final.webp 212B
  1021. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/use-gpt.webp 14.97KB
  1022. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._use-gpt.webp 212B
  1023. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/shortcut-example.webp 32.1KB
  1024. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._shortcut-example.webp 212B
  1025. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/mental-model.webp 25.04KB
  1026. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._mental-model.webp 212B
  1027. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/iterative-generate.webp 23.72KB
  1028. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._iterative-generate.webp 212B
  1029. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/chapter-steps.webp 29.38KB
  1030. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._chapter-steps.webp 212B
  1031. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/layernorm2.webp 13.96KB
  1032. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._layernorm2.webp 212B
  1033. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/generate-text.webp 36.26KB
  1034. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._generate-text.webp 212B
  1035. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/transformer-block.webp 25.86KB
  1036. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._transformer-block.webp 212B
  1037. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/mental-model-2.webp 14.87KB
  1038. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._mental-model-2.webp 212B
  1039. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/mental-model-3.webp 21.01KB
  1040. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._mental-model-3.webp 212B
  1041. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/iterative-gen.webp 17.92KB
  1042. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._iterative-gen.webp 212B
  1043. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/gpt-in-out.webp 20.97KB
  1044. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._gpt-in-out.webp 212B
  1045. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/layernorm.webp 26.98KB
  1046. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._layernorm.webp 212B
  1047. llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/ffn.webp 24.34KB
  1048. __MACOSX/llms-from-scratch-cn-main/Codes/ch04/01_main-chapter-code/figures/._ffn.webp 212B
  1049. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/weight-selfattn-3.png 53.18KB
  1050. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._weight-selfattn-3.png 212B
  1051. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/weight-selfattn-2.png 60.61KB
  1052. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._weight-selfattn-2.png 212B
  1053. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/dot-product.png 93.4KB
  1054. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._dot-product.png 212B
  1055. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/weight-selfattn-1.png 52.21KB
  1056. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._weight-selfattn-1.png 212B
  1057. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/weight-selfattn-4.png 53.85KB
  1058. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._weight-selfattn-4.png 212B
  1059. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/attention.png 66.62KB
  1060. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._attention.png 212B
  1061. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/masked.png 59.09KB
  1062. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._masked.png 212B
  1063. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/single-head.png 71.34KB
  1064. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._single-head.png 212B
  1065. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/multi-head.png 59.77KB
  1066. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._multi-head.png 212B
  1067. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/attention-matrix.png 136.29KB
  1068. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._attention-matrix.png 212B
  1069. llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/dropout.png 62.86KB
  1070. __MACOSX/llms-from-scratch-cn-main/Codes/ch03/01_main-chapter-code/figures/._dropout.png 212B
  1071. llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/gpt2_model/encoder.json 1017.87KB
  1072. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/02_bonus_bytepair-encoder/gpt2_model/._encoder.json 212B
  1073. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/4.png 290.55KB
  1074. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/._4.png 212B
  1075. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/5.png 288.61KB
  1076. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/._5.png 212B
  1077. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/2.png 132.57KB
  1078. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/._2.png 212B
  1079. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/3.png 216.44KB
  1080. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/._3.png 212B
  1081. llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/1.png 133.33KB
  1082. __MACOSX/llms-from-scratch-cn-main/Codes/ch02/03_bonus_embedding-vs-matmul/images/._1.png 212B
  1083. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/images/img-1.webp 86.94KB
  1084. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/images/._img-1.webp 212B
  1085. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/images/img-3.webp 58.68KB
  1086. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/images/._img-3.webp 212B
  1087. llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/images/img-2.webp 72.46KB
  1088. __MACOSX/llms-from-scratch-cn-main/Codes/ch05/01_main-chapter-code/images/._img-2.webp 212B
  1089. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/download.png 174.07KB
  1090. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/._download.png 212B
  1091. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/pytorch-installer.jpg 94.51KB
  1092. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/._pytorch-installer.jpg 212B
  1093. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/new-env.png 185.38KB
  1094. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/._new-env.png 212B
  1095. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/miniforge-install.png 258.47KB
  1096. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/._miniforge-install.png 212B
  1097. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/check-pip.png 219.68KB
  1098. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/._check-pip.png 212B
  1099. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/conda-install.png 186.52KB
  1100. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/._conda-install.png 212B
  1101. llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/activate-env.png 180KB
  1102. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/01_optional-python-setup-preferences/figures/._activate-env.png 212B
  1103. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/watermark.jpg 35.99KB
  1104. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/._watermark.jpg 212B
  1105. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/pytorch-installer.jpg 94.51KB
  1106. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/._pytorch-installer.jpg 212B
  1107. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/jupyter-issues.jpg 102.72KB
  1108. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/._jupyter-issues.jpg 212B
  1109. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/check_2.jpg 78.97KB
  1110. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/._check_2.jpg 212B
  1111. llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/check_1.jpg 107.24KB
  1112. __MACOSX/llms-from-scratch-cn-main/Codes/appendix-A/02_installing-python-libraries/figures/._check_1.jpg 212B

资源介绍:

如果你想从0手写代码,构建大语言模型,本项目很适合你。 本项目 "LLMs From Scratch" 是由 Datawhale 提供的一个从头开始构建类似 ChatGPT 大型语言模型(LLM)的实践教程。 我们旨在通过详细的指导、代码示例和深度学习资源,帮助开发者和研究者掌握创建大语言模型和大语言模型架构的核心技术。 本项目包括了从0逐步构建GLM4\Llama3\RWKV6的教程,从0构建大模型,一起深入理解大模型原理。
# 从头开始实现llama3 在这个文件中,我逐个张量和矩阵地从头实现了llama3。 本地可以运行:llama3-from-scratch.ipynb
此外,我将直接从meta提供给llama3的模型文件中加载张量,你需要在运行此文件之前下载权重。 这是下载权重的官方链接: [点击这里下载权重](https://llama.meta.com/llama-downloads/)
https://hf-mirror.com/NousResearch/Meta-Llama-3-8B https://gitee.com/hf-models/Meta-Llama-3-8B-Instruct/ ## 分词器 我不打算实现一个BPE分词器(但是Andrej Karpathy有一个非常干净的实现)。
他的实现链接: [点击这里查看他的实现](https://github.com/karpathy/minbpe)
```python %env HF_ENDPOINT = "https://hf-mirror.com" ``` env: HF_ENDPOINT="https://hf-mirror.com" ```python %pip install blobfile -q ``` Note: you may need to restart the kernel to use updated packages. ```python from pathlib import Path import tiktoken from tiktoken.load import load_tiktoken_bpe import torch import json import matplotlib.pyplot as plt tokenizer_path = "./tokenizer.model" special_tokens = [ "<|begin_of_text|>", "<|end_of_text|>", "<|reserved_special_token_0|>", "<|reserved_special_token_1|>", "<|reserved_special_token_2|>", "<|reserved_special_token_3|>", "<|start_header_id|>", "<|end_header_id|>", "<|reserved_special_token_4|>", "<|eot_id|>", # end of turn ] + [f"<|reserved_special_token_{i}|>" for i in range(5, 256 - 5)] mergeable_ranks = load_tiktoken_bpe(tokenizer_path) tokenizer = tiktoken.Encoding( name=Path(tokenizer_path).name, pat_str=r"(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}{1,3}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]+|\s+(?!\S)|\s+", mergeable_ranks=mergeable_ranks, special_tokens={token: len(mergeable_ranks) + i for i, token in enumerate(special_tokens)}, ) tokenizer.decode(tokenizer.encode("hello world!")) ``` 'hello world!' ## 读取模型文件 通常,读取模型文件取决于模型类的编写方式以及其中的变量名。
但由于我们是从头开始实现llama3,我们将逐个张量地读取文件。
可以在这里下载模型:https://gitee.com/hf-models/Meta-Llama-3-8B-Instruct/blob/main/original/consolidated.00.pth ```python !wget 'https://lfs.gitee.com/api/lfs/storage/projects/34266234/be52262c9289304f3e8240e0749bf257bc04264405a86cd4de38efb9068724ee?Expires=1716626632&Signature=xgDOu9JHNM6ECazR3nA4NQHwXs%2BiG%2BCtnzza6ekSuqs%3D&FileName=consolidated.00.pth' ``` --2024-05-25 16:24:15-- https://lfs.gitee.com/api/lfs/storage/projects/34266234/be52262c9289304f3e8240e0749bf257bc04264405a86cd4de38efb9068724ee?Expires=1716626632&Signature=xgDOu9JHNM6ECazR3nA4NQHwXs%2BiG%2BCtnzza6ekSuqs%3D&FileName=consolidated.00.pth Resolving lfs.gitee.com (lfs.gitee.com)... 180.76.198.180 Connecting to lfs.gitee.com (lfs.gitee.com)|180.76.198.180|:443... connected. HTTP request sent, awaiting response... 200 OK Length: 16060617592 (15G) [application/octet-stream] Saving to: ‘be52262c9289304f3e8240e0749bf257bc04264405a86cd4de38efb9068724ee?Expires=1716626632&Signature=xgDOu9JHNM6ECazR3nA4NQHwXs+iG+Ctnzza6ekSuqs=&FileName=consolidated.00.pth’ 0% [ ] 105,193,134 453KB/s eta 11h 21m^C 我的机器12s可以载入,接下来仅用cpu进行推理,我这边内存30G足够了,然后cpu推理一个词大约30s,稍微慢了一些,不过我们主要理解原理 ```python model = torch.load("/data1/ckw/consolidated.00.pth") print(json.dumps(list(model.keys())[:20], indent=4)) ``` [ "tok_embeddings.weight", "layers.0.attention.wq.weight", "layers.0.attention.wk.weight", "layers.0.attention.wv.weight", "layers.0.attention.wo.weight", "layers.0.feed_forward.w1.weight", "layers.0.feed_forward.w3.weight", "layers.0.feed_forward.w2.weight", "layers.0.attention_norm.weight", "layers.0.ffn_norm.weight", "layers.1.attention.wq.weight", "layers.1.attention.wk.weight", "layers.1.attention.wv.weight", "layers.1.attention.wo.weight", "layers.1.feed_forward.w1.weight", "layers.1.feed_forward.w3.weight", "layers.1.feed_forward.w2.weight", "layers.1.attention_norm.weight", "layers.1.ffn_norm.weight", "layers.2.attention.wq.weight" ] ```python with open("./params.json", "r") as f: config = json.load(f) config ``` {'dim': 4096, 'n_layers': 32, 'n_heads': 32, 'n_kv_heads': 8, 'vocab_size': 128256, 'multiple_of': 1024, 'ffn_dim_multiplier': 1.3, 'norm_eps': 1e-05, 'rope_theta': 500000.0} ## 我们使用这个配置来推断模型的细节,比如: 1. 模型有32个Transformer层 2. 每个多头注意力块有32个头 3. 词汇表大小,等等 ```python dim = config["dim"] n_layers = config["n_layers"] n_heads = config["n_heads"] n_kv_heads = config["n_kv_heads"] vocab_size = config["vocab_size"] multiple_of = config["multiple_of"] ffn_dim_multiplier = config["ffn_dim_multiplier"] norm_eps = config["norm_eps"] rope_theta = torch.tensor(config["rope_theta"]) ``` ## 将文本转换为标记 这里我们使用tiktoken(我认为是OpenAI的一个库)作为分词器
```python prompt = "the answer to the ultimate question of life, the universe, and everything is " tokens = [128000] + tokenizer.encode(prompt) print(tokens) tokens = torch.tensor(tokens) prompt_split_as_tokens = [tokenizer.decode([token.item()]) for token in tokens] print(prompt_split_as_tokens) ``` [128000, 1820, 4320, 311, 279, 17139, 3488, 315, 2324, 11, 279, 15861, 11, 323, 4395, 374, 220] ['<|begin_of_text|>', 'the', ' answer', ' to', ' the', ' ultimate', ' question', ' of', ' life', ',', ' the', ' universe', ',', ' and', ' everything', ' is', ' '] ## 将标记转换为它们的嵌入向量 这是代码库中我唯一使用内置神经网络模块的部分。
无论如何,我们的[17x1]标记现在是[17x4096],即长度为4096的17个嵌入向量(每个标记一个)。

注意: 跟踪形状,这样可以更容易理解所有内容
```python embedding_layer = torch.nn.Embedding(vocab_size, dim) embedding_layer.weight.data.copy_(model["tok_embeddings.weight"]) token_embeddings_unnormalized = embedding_layer(tokens).to(torch.bfloat16) token_embeddings_unnormalized.shape ``` torch.Size([17, 4096]) ## 然后我们使用RMS归一化来标准化嵌入向量 请注意,在此步骤之后,形状不会改变,只是值被标准化了。
需要记住的一些事情,我们需要一个norm_eps(来自配置),因为我们不希望意外地将RMS设置为0并除以0。
以下是公式:
```python # def rms_norm(tensor, norm_weights): # rms = (tensor.pow(2).mean(-1, keepdim=True) + norm_eps)**0.5 # return tensor * (norm_weights / rms) def rms_norm(tensor, norm_weights): return (tensor * torch.rsqrt(tensor.pow(2).mean(-1, keepdim=True) + norm_eps)) * norm_weights ``` # 构建Transformer的第一层 ### 标准化 你会看到我从模型字典中访问layer.0(这是第一层)。
无论如何,所以在我们标准化后,形状仍然是[17x4096],与嵌入向量相同,但是标准化了
```python token_embeddings = rms_norm(token_embeddings_unnormalized, model["layers.0.attention_norm.weight"]) to
100+评论
captcha