Timestamp
00:00 Intro
00:21 MoT
03:07 JudgeLM
05:52 Emulator for Fine-Tuned LLMs
08:19 ConvNet matches ViT
09:57 Zephyr Technical Paper
12:08 Text to NeRF
14:04 KITAB
16:06 AlpaGasus
17:43 Fine-Tuning Jailbreaks
19:27 Vicuna Radiology
21:04 QMoE
32:38 Zephyr Model
32:38 Marker 1
25:12 Narwhal Mistral Model Merges
26:45 GradioLite
28:14 RLMRec
31:56 OpenAI Prepardness
34:25 Poe Creator Monetization
35:10 Draft Ai Executive Order
Links at the bottom section!!!
If you want to support the channel
Support here:
Patreon – https://www.patreon.com/1littlecoder/
Ko-Fi – https://ko-fi.com/1littlecoder
Follow me on
Twitter – https://twitter.com/1littlecoder
Linkedin – https://www.linkedin.com/in/amrrs/
Links
Papers
Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation – https://arxiv.org/pdf/2310.15961.pdf
JudgeLM – https://arxiv.org/pdf/2310.17631.pdf
An Emulator for Fine-Tuning Large Language Models using Small Language Models
https://arxiv.org/pdf/2310.12962.pdf
ConvNets Match Vision Transformers at Scale https://arxiv.org/pdf/2310.16764.pdf
QMoE – https://arxiv.org/pdf/2310.16795.pdf
ZEPHYR: DIRECT DISTILLATION OF LM ALIGNMENT https://arxiv.org/pdf/2310.16944.pdf
HYPERFIELDS: TOWARDS ZERO-SHOT GENERATION OF NERFS FROM TEXT https://arxiv.org/pdf/2310.17075.pdf
KITAB – https://arxiv.org/pdf/2310.15511.pdf
Fine-tune LLMs for harmful response https://arxiv.org/abs/2310.03693 https://arxiv.org/pdf/2310.03693.pdf
Feasibility of Using the Privacy-preserving Large Language Model Vicuna for Labeling Radiology Reports https://pubs.rsna.org/doi/epdf/10.1148/radiol.231147
Models
Zephyr Beta https://huggingface.co/HuggingFaceH4/zephyr-7b-beta
Eric Hartford’s Dolphin2.1 with MetaMath-Mistral by Meta Math. Then that model was merged with and HuggingFace’s Zephyr-7b-beta https://huggingface.co/Vezora/Mistral-Narwhal-7b https://huggingface.co/Vezora/Mistral-Narwhal-7b-v2
Open Source
Serverless Gradio (Gradio Lite) – https://www.gradio.app/guides/gradio-lite
RLMRec: Representation Learning with Large Language Models for Recommendation https://github.com/hkuds/rlmrec#rlmrec-representation-learning-with-large-language-models-for-recommendation
OpenAgents – https://github.com/xlang-ai/OpenAgents
General News
Frontier Risks and OpenAI preparedness Challenge https://openai.com/blog/frontier-risk-and-preparedness
Poe Creator Monetization Program https://quorablog.quora.com/Introducing-creator-monetization-for-Poe