我想要一天分享一點「LLM從底層堆疊的技術」，並且每篇文章長度控制在三分鐘以內，讓大家不會壓力太大，但是又能夠每天成長一點。



從 AI說書 - 從0開始 - 82 到 AI說書 - 從0開始 - 85 的說明，有一個很重要的結論：最適合您的模型不一定是排行榜上最好的模型，您需要學習 NLP 評

<html lang="en"><head><style>              .article-container {                width: 100%;                font-family: Microsoft JhengHei,Helvetica Neue,Helvetica,Arial,sans-serif;              }              ul, ol {                margin: 12px auto;                max-width: 740px;                color: #535150;                line-height: 1.8;                padding-left: 0px;              }              .graf--img {                display: table;                justify-content: center;                align-items: center;                text-align: center;                color: gray;                font-size: 14px;                letter-spacing: 0px;                margin: 10px auto 50px;                width: 100%;                position: relative;                clear: both;              }              .graf--img.center img {                width: 100%;                max-width: 740px;                margin: 10px auto 0px;                display: block;                margin: 0 auto;              }              .graf--img.full img {                width: 100%;              }              .captionTheme__wrapper {                width: 100%;                font-style: normal;                line-height: 22px;                font-size: 16px;                max-width: 600px;                margin-top: 8px;                display: inline-block;              }              .graf--img.full {                max-width: 100%;                margin: 40px 0px;                display: block;                margin: 0 auto;                align-items: center;              }              .graf--figure {                text-align: center;                color: gray;                font-style: italic;                font-size: 15px;                margin: 28px auto;                box-sizing: border-box;              }              .graf--figure iframe {                width: 100%;                max-width: 740px;                margin: 0 auto;              }              .graf--p {                font-size: 16px;                line-height: 1.8;                font-family: "Microsoft JhengHei fixed", "Helvetica Neue" ,"Microsoft JhengHei", Helvetica, "Segoe UI", Tahoma, Arial, sans-serif;                letter-spacing: 1px;                font-weight: 400;                max-width: 740px;                color: #535150;                text-align: left;              }              .graf--p > a {                color: #00B3C6 !important;                text-decoration: none !important;              }              .graf--li > a {                color: #00B3C6 !important;                text-decoration: none !important;              }              .graf--quotesSpecial > a {                color: #00B3C6 !important;                text-decoration: none !important;              }              .graf--blockquote > a {                color: #00B3C6 !important;                text-decoration: none !important;              }              .graf--h1 > a {                color: #00B3C6 !important;                text-decoration: none !important;              }              .graf--h2 > a {                color: #00B3C6 !important;                text-decoration: none !important;              }              .graf--h3 > a {                color: #00B3C6 !important;                text-decoration: none !important;              }              a.graf--mention {                color: #535150 !important;                text-decoration: underline !important;                font-weight: 700;              }              .graf--h2 {                font-size: 24px;                padding: 0;                max-width: 740px;                text-align: left;                letter-spacing: 1px;                font-weight: 700;                margin-top: 34px;                line-height: 1.5;              }              .graf--h3 {                font-size: 18px;                padding: 0;                max-width: 740px;                text-align: left;                letter-spacing: 1px;                font-weight: 700;                margin-top: 28px;                line-height: 1.5;              }              .graf--li {                font-size: 16px;                padding: 0px 0px 0px 4px;                font-weight: 400;                letter-spacing: 0px;                list-style-position: outside;                text-align:left;                margin-left: 24px;              }              .graf--hr {                width: 100%;                margin: 0px auto;                transform: translateY(-50%);                position: relative;                padding: 0px;                text-align: left;                max-width: 740px;                margin: 0 auto;              }              .graf--hr hr {                height: 0;              }              .graf--blockquote {                padding: 10px 0px 10px 16px;                font-size: 16px;                color: #7A7574;                letter-spacing: 1px;                margin: 28px 0px;                border-left: 4px solid #DDD9D8;                width: 100%;                max-width: 740px;                text-align: left;              }              .graf--quotesSpecial {                display: table;                color: #7A7574;                position: relative;                padding: 31.5px 40px;                text-align: center;                letter-spacing: 0px;                position: relative;                margin: 29px auto;                font-family: "Microsoft JhengHei fixed", "Helvetica Neue", "Microsoft JhengHei", Helvetica, "Segoe UI", Tahoma, Arial, sans-serif;                font-size: 16px;                -webkit-box-ordinal-group: 1;                -webkit-box-flex: 0;              }              .embed-wrapper {                max-width: 740px;                border: 1px solid #DDD9D8;                display: block;                padding: 12px;                border-radius: 8px;                margin: 12px 0px;                text-decoration: none !important;              }              .embed-title {                font-size: 16px;                font-weight: 700;                color: #535150;                margin-bottom: 8px;                text-align: left;                line-height: 1.5;                text-decoration: none !important;              }              .embed-description {                width: 100%;                font-size: 14px;                color: #7A7574;                line-height: 1.5;                max-height: 150px;                text-align: left;                overflow: hidden;                padding: 12px 0px;              }              .embed-url > a {                width: 100%;                font-size: 14px;                color: #141413 !important;                text-decoration: none !important;                line-height: 1.5;                text-align: left;              }                            .embed-thumbnail-wrapper {                padding-left: 12px;              }              .embed-thumbnail {                width:100px;                border-radius: 8px;              }              pre {                background: #F6F6F6;                border-radius: 8px;                padding: 16px;                font-size: 16px;                color: #535150;                line-height: 180%;                text-align: left;              }              .lexical__textBold {                font-weight: bold;              }              .lexical__textItalic {                font-style: italic;              }              .lexical__textUnderline {                text-decoration: underline;              }              .lexical__textStrikethrough {                text-decoration: line-through;              }              .lexical__textUnderlineStrikethrough {                text-decoration: underline line-through;              }              .lexical__textSubscript {                font-size: 0.8em;                vertical-align: sub;              }              .lexical__textSuperscript {                font-size: 0.8em;                vertical-align: super;              }              .lexical__textCode {                background-color: rgb(240, 242, 245);                padding: 1px 0.25rem;                font-family: Menlo, Consolas, Monaco, monospace;                font-size: 94%;              }            </style></head><body><div class="article-container"><p class="graf--p" dir="ltr"><span style="white-space: pre-wrap;">我想要一天分享一點「LLM從底層堆疊的技術」，並且每篇文章長度控制在三分鐘以內，讓大家不會壓力太大，但是又能夠每天成長一點。</span></p><p class="graf--p" dir="ltr"><br></p><p class="graf--p" dir="ltr"><span style="white-space: pre-wrap;">從 </span><a href="https://vocus.cc/article/668d486afd89780001eee47d" target="_blank"><span style="white-space: pre-wrap;">AI說書 - 從0開始 - 82</span></a><span style="white-space: pre-wrap;"> 到 </span><a href="https://vocus.cc/article/6690cd65fd89780001c35b12" target="_blank"><span style="white-space: pre-wrap;">AI說書 - 從0開始 - 85</span></a><span style="white-space: pre-wrap;"> 的說明，有一個很重要的結論：</span><b><strong class="lexical__textBold" style="white-space: pre-wrap;">最適合您的模型不一定是排行榜上最好的模型，您需要學習 NLP 評估流程並將其應用到您選擇實施的模型中</strong></b><span style="white-space: pre-wrap;">。</span></p><p class="graf--p" dir="ltr"><br></p><p class="graf--p" dir="ltr"><span style="white-space: pre-wrap;">有鑑於此，有必要學習一下評估流程 (Evaluation Process) 是怎麼回事。</span></p><p class="graf--p" dir="ltr"><br></p><p class="graf--p" dir="ltr"><span style="white-space: pre-wrap;">Wang 等人於 2019 為他們的 SuperGLUE Benchmark 選擇了 NLP 的實際代表性任務，這些任務的選擇標準比 GLUE 更嚴格，例如，任務不僅必須理解文本，還必須理解推理 (Reason)，推理水平還不是人類頂尖專家的水平，然而，性能水準足以取代許多人工任務。</span></p><p class="graf--p" dir="ltr"><br></p><p class="graf--p" dir="ltr"><span style="white-space: pre-wrap;">主要的 SuperGLUE 任務顯示在 </span><a href="https://" rel="noreferrer"><span style="white-space: pre-wrap;">https://super.gluebenchmark.com/tasks</span></a><span style="white-space: pre-wrap;">，如下所示：</span></p><div class="graf--img center"><div class="lexical__imageWrapper"><img src="https://images.vocus.cc/124ba388-44da-4b9e-adfb-c3ceb43b6c8e.png" data-src="https://d2a6d2ofes041u.cloudfront.net/resize?compression=6&norotation=true&url=https%3A%2F%2Fimages.vocus.cc%2F124ba388-44da-4b9e-adfb-c3ceb43b6c8e.png&width=740&sign=73sOiJaNUB0SKopKcY93fCkNjJyiCf59G5KR_aG2k4I" class="lazy" data-original-src="https://images.vocus.cc/124ba388-44da-4b9e-adfb-c3ceb43b6c8e.png" data-lowquality="false" data-width="1632" data-height="1212" alt="圖片出自書籍：Transformers for Natural Language Processing and Computer Vision, 2024"></div><div class="captionTheme__wrapper"><p class="captionTheme__paragraph">圖片出自書籍：Transformers for Natural Language Processing and Computer Vision, 2024</p></div></div><p class="graf--p" dir="ltr"><br></p></div></body></html>

以行動支持創作者！付費即可解鎖

贊助珍奶

學習

AI說書 - 從0開始 - 87

AI說書 - 從0開始 - 85

前言

讀了許多理論，是時候實際動手做做看了，以下是我的模型訓練初體驗，有點糟就是了XD。



正文

def conv(filters, kernel_size, strides=1):
	return Conv2D(filters, 
									kernel_size, 
						

柴郡貓姍蒂的沙龍

筆記-深度學習模型訓練：利用殘差網路做影像辨識

最新的AI趨勢讓人眼花撩亂，不知要如何開始學習？本文介紹了作者對AI的使用和體驗，以及各類AI工具以及推薦的選擇。最後強調了AI是一個很好用的工具，可以幫助人們節省時間並提高效率。鼓勵人們保持好奇心，不停止學習，並提出了對健康生活和開心生活的祝福。

以前很愛看書的我，終於發現自己很久沒看書了。2024年設定一個每月讀二本書和寫二篇讀後心得的目標，重新學習寫作，後經友人推薦來方格子開房間。我的部落格從2000年左右開始寫，從PC Home電子報、無名小站、Blogger、到現在的WordPress，單純記錄日常生活點滴。www.SabrinaHuang.com

莎姐的矽谷茶棧

你開始使用AI了嗎？

筆記-曲博談AI模型.群聯-24.05.05

https://www.youtube.com/watch?v=JHE88hwx4b0&t=2034s

*大型語言模型 三個步驟:

1.預訓練，訓練一次要用幾萬顆處理器、訓練時間要1個月，ChatGPT訓練一次的成本為1000萬美金。

2.微調(

每日發車

筆記-曲博談AI模型.群聯-24.05.05

AI 相關的內容每天都非常多，有聽過很多人因此感覺到焦慮，怕錯過了最新資訊就會趕不上，這篇內容會跟大家詳細的分享我自己的學習方法和經驗，並且會在最後分享一些我的學習資訊來源。

創作邦致力分享設計新知、創作工具、高效工作方法，我們的沙龍提供各種給設計師和創作者的實用知識與資源，如果你付費訂閱我們，還會提供你更深度的內容分享、專屬討論區、會員購買數位商品限定優惠等福利。

創作邦｜設計X工具X品牌的沙龍

我如何從零開始接觸與學習 AI，超詳細學習方法與心得

職場

親子與教育

軟體開發

https://www.youtube.com/watch?v=wjZofJX0v4M

這是我看過最好的AI科普影片了；現在流行的GPT使用的大語言模型 (large language model, LLM), 是把每一個單字都當作一個高維度向量 

影片中GPT3共儲存50257個英文單字, 每

電影戲劇

國際

科技

白話詹的沙龍

淺聊AI

閱讀書評

<div class="draft-block draft--p left">謝謝分享，看了這篇文章讓我想要認真研究一番了。</div>

下午茶

創作

投資理財

請支援收銀

創作邦訂閱精選文章

大語言模型能夠生成文本，因此被認為是生成式人工智慧的一種形式。



人工智慧的學科任務，是製作機器，使其能執行需要人類智慧才能執行的任務，例如理解語言，便是模式，做出決策。



除了大語言模型，人工智慧也包含了深度學習以及機器學習。



機器學習的學科任務，是透過演算法來實踐AI。



特別

<div class="draft-block draft--p left">有時候還很依賴ai（看見滿滿的公文）</div>

LLM 筆記

王啟樺的沙龍

LLM 003｜人工智慧如何從數據中學習？

這陣子使用AI模型，還有參考國內外一些喜歡玩語言模型的同好發文，一個很有趣的結論就是，有時候把大型語言模型(尤其ChatGPT)當作一個人來溝通，會得到比較好的結果，這的確是非常反直覺的，也就是說很多時候ChatGPT耍懶不肯工作的時候，你用加油打氣，或是情緒勒索的方法，確實是可以得到比較好的結果。

技術PM的AI實驗室

技術PM的AI實驗室，是以輕鬆的角度深入簡出的探討各種生成式AI工具的使用。無論你是想理解AI到底是怎麼運作的? 想知道有那些好用的生成式AI工具? 或者是對AI繪圖有興趣的，都歡迎加入我們的AI實驗室一起輕鬆地玩耍，我們邊玩邊學，學習跟AI一起共創新的可能。

技術PM路易斯的沙龍

情緒勒索你的AI來得到最佳的結果

看了這個視頻， 
更確定我們的投資展望！ 
這是一個很棒的視頻， 
一起學習新知識。😀

他把AI說得讓我這種不明白的人也很容易懂， 
還有人生不同層面， 
都很棒🙏

 https://youtu.be/qyOCtY1E4Qk?si=W5vXDjqsFzykvK3z 



凱特的貴婦之路

身為一個媽媽，最重要的是給孩子一個正確的人生指引．
凱特出生於平凡家庭，期許自己能成為一個真誠善良，並且獨立傲然的女人．
成功是一種觀念，致富是一種義務，快樂是一種權力．
我曾以為自由是可以隨心所欲，後來才發現自由是不必身不由己．
希望藉由自己的文章，讓更多的朋友能夠成為自己想當的人．　

凱特的貴婦之路的沙龍

AI可以做什麼?

延續上週提到的，「有哪些不訓練模型的情況下，能夠強化語言模型的能力」，這堂課接續介紹其中第 3、4 個方法

<div class="draft-block draft--p left">話說Chatgpt 的DALLE 的限制是真的多🥲</div>

Enjoy sharing | 享受分享 | 日常 x 學習 x 閱讀
初衷是把生活中所學與大家分享
也歡迎一起來進行🤗

ezra.share.injoy

學習筆記【生成式AI導論 2024】第4講：訓練不了人工智慧？你可以訓練你自己 (中) — 拆解問題與使用工具

AI從0開始-第三章

三分鐘學AI

這裡將提供： AI、Machine Learning、Deep Learning、Reinforcement Learning、Probabilistic Graphical Model的讀書筆記與演算法介紹，一起在未來AI的世界擁抱AI技術，不BI。

Learn AI 不 BI

圓圓回憶錄

AI從0開始-第一章

AI從0開始-第二章

AI馴獸師-第零章

AI馴獸師-第一章

AI從0開始-第四章

AI馴獸師-第二章

AI從0開始-第五章

AI從0開始-第六章

AI從0開始-第七章

AI馴獸師-第三章

AI從0開始-第八章

AI從0開始-第九章

AI馴獸師-第四章

AI從0開始-第十章

AI從0開始-十一章

AI馴獸師-第五章

AI從0開始-十二章

AI從0開始-十三章

AI馴獸師-第六章

三分鐘學AI (2)

證照相關

自然語言處理相關

機率圖模型

AI從0開始-十四章

AI馴獸師-第七章

AI從0開始-十五章

AI從0開始-十六章

AI馴獸師-第八章

AI從0開始-十七章

AI從0開始-十八章

AI從0開始-十九章

AI馴獸師-第九章

AI從0開始-二十章

三分鐘學AI (3)

AI馴獸師-第十章

AI馴獸師-第十一章

AI馴獸師-第十二章

AI馴獸師-第十三章

AI馴獸師-第十四章

AI馴獸師-第十五章

AI馴獸師-第十六章

AI馴獸師-第十七章

AI馴獸師-第十八章

AI馴獸師-第十九章

三分鐘學AI (4)

AI馴獸師-第二十章

AI馴獸師-二十一章

AI馴獸師-二十二章

AI馴獸師-二十三章

AI馴獸師-二十四章

AI馴獸師-二十五章

三分鐘學AI (5)

AI說書 - 從0開始 - 86

<p class="lexical__paragraph" dir="ltr"><span style="white-space: pre-wrap;">我想要一天分享一點「LLM從底層堆疊的技術」，並且每篇文章長度控制在三分鐘以內，讓大家不會壓力太大，但是又能夠每天成長一點。</span></p><p class="lexical__paragraph" dir="ltr"><br></p><p class="lexical__paragraph" dir="ltr"><div class="ad-placeholder" style="min-height: 124px;"></div><span style="white-space: pre-wrap;">從 </span><a href="https://vocus.cc/article/668d486afd89780001eee47d" target="_blank"><span style="white-space: pre-wrap;">AI說書 - 從0開始 - 82</span></a><span style="white-space: pre-wrap;"> 到 </span><a href="https://vocus.cc/article/6690cd65fd89780001c35b12" target="_blank"><span style="white-space: pre-wrap;">AI說書 - 從0開始 - 85</span></a><span style="white-space: pre-wrap;"> 的說明，有一個很重要的結論：</span><b><strong class="lexical__textBold" style="white-space: pre-wrap;">最適合您的模型不一定是排行榜上最好的模型，您需要學習 NLP 評估流程並將其應用到您選擇實施的模型中</strong></b><span style="white-space: pre-wrap;">。</span></p><p class="lexical__paragraph" dir="ltr"><br></p><p class="lexical__paragraph" dir="ltr"><span style="white-space: pre-wrap;">有鑑於此，有必要學習一下評估流程 (Evaluation Process) 是怎麼回事。</span></p><p class="lexical__paragraph" dir="ltr"><br></p><p class="lexical__paragraph" dir="ltr"><span style="white-space: pre-wrap;">Wang 等人於 2019 為他們的 SuperGLUE Benchmark 選擇了 NLP 的實際代表性任務，這些任務的選擇標準比 GLUE 更嚴格，例如，任務不僅必須理解文本，還必須理解推理 (Reason)，推理水平還不是人類頂尖專家的水平，然而，性能水準足以取代許多人工任務。</span></p><p class="lexical__paragraph" dir="ltr"><br></p><p class="lexical__paragraph" dir="ltr"><span style="white-space: pre-wrap;">主要的 SuperGLUE 任務顯示在 </span><a href="https://" rel="noreferrer"><span style="white-space: pre-wrap;">https://super.gluebenchmark.com/tasks</span></a><span style="white-space: pre-wrap;">，如下所示：</span></p><div class="lexical__image center"><div class="lexical__imageWrapper"><img src="https://resize-image.vocus.cc/resize?compression=6&amp;norotation=true&amp;url=https%3A%2F%2Fimages.vocus.cc%2F124ba388-44da-4b9e-adfb-c3ceb43b6c8e.png&amp;width=740&amp;sign=73sOiJaNUB0SKopKcY93fCkNjJyiCf59G5KR_aG2k4I" fetchpriority="high" data-src="https://resize-image.vocus.cc/resize?compression=6&amp;norotation=true&amp;url=https%3A%2F%2Fimages.vocus.cc%2F124ba388-44da-4b9e-adfb-c3ceb43b6c8e.png&amp;width=740&amp;sign=73sOiJaNUB0SKopKcY93fCkNjJyiCf59G5KR_aG2k4I" data-loaded="true" data-original-src="https://images.vocus.cc/124ba388-44da-4b9e-adfb-c3ceb43b6c8e.png" data-lowquality="false" data-width="1632" data-height="1212" data-retry="0" onerror="Number(this.dataset.retry) > 4 ? this.src='/static/default-error-img.svg': (() => {this.src=this.dataset.originalSrc; this.dataset.retry = Number(this.dataset.retry)+1;})()" data-istopthreeimage="true" alt="圖片出自書籍：Transformers for Natural Language Processing and Computer Vision, 2024"></div><div class="captionTheme__wrapper"><p class="captionTheme__paragraph">圖片出自書籍：Transformers for Natural Language Processing and Computer Vision, 2024</p></div></div><p class="lexical__paragraph" dir="ltr"><br></p>

<div class="draft-block draft--p left">
              <a href="/user/@65afe55afd8978000172eb88" target="_blank" class="draft--mention">
                <i class="icon-mention"></i>
                <span>凱特</span>
              </a>
             小心上癮喔！ＬＯＬ</div>

<div class="draft-block draft--p left">
              <a href="/user/@successwritercylin" target="_blank" class="draft--mention">
                <i class="icon-mention"></i>
                <span>媗日</span>
              </a>
             歐！真的是實際用才知道的細節呢~</div>