我想要一天分享一點「LLM從底層堆疊的技術」,並且每篇文章長度控制在三分鐘以內,讓大家不會壓力太大,但是又能夠每天成長一點。
以下附上參考項目:
- Barham et al., 2022, Pathways: Asynchronous Distributed Dataflow for ML: https://arxiv.org/ abs/2203.12533
- Chowdhery et al., 2022, PaLM: Scaling Language Modeling with Pathways: https://arxiv.org/ pdf/2204.02311.pdf
- Anil et al., 2023, PaLM 2 Technical Report: https://arxiv.org/abs/2305.10403
- Google Cloud Vertex AI documentation: https://cloud.google.com/vertex-ai/docs/generative-ai/learn/overview
以下附上額外閱讀項目:
- Generative AI repository (Google): https://github.com/GoogleCloudPlatform/generative-ai/#readme
- Roberto Gozalo-Brizuela, and Eduardo C. Garrido-Merchán, 2023, A survey of Generative AI Appli- cations: https://arxiv.org/abs/2306.02781