Radio 4 for woke people.
…
continue reading
"Writing It! The Podcast About Academics & Writing" dives deep into the world of academic writing and publishing. Join us for conversations with academics and editors as we discuss challenges, strategies, and insights from our writing lives. As we share our experiences and helpful hacks, we make the process of writing and getting published a bit more transparent and a bit less overwhelming.
…
continue reading
Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers Support this podcast: https://podcasters.spotify.com/pod/s ...
…
continue reading
Welcome to the Data Science Conversations Podcast hosted by Damien Deighan and Dr Philipp Diesinger. We bring you interesting conversations with the world’s leading Academics working on cutting edge topics with potential for real world impact. We explore how their latest research in Data Science and AI could scale into broader industry applications, so you can expand your knowledge and grow your career. Every 4 or 5 episodes we will feature an industry trailblazer from a strong academic back ...
…
continue reading
A monthly conversation about books and ideas on NTS Radio hosted by friends Carrie Plitt, a literary agent, and Octavia Bright, a writer and academic. Each show features an author interview, book recommendations, lively discussion and a little music too, all built around a related theme - anything from the novella to race to masculinity. Listen live on NTS Radio www.nts.live
…
continue reading
Agentic schools are extraordinary places where children have real power to determine the direction of their education. These are schools that recognize the academic curriculum as being less important than the hidden social curriculum. Your host Don Berg interviews the people who make the magic of agentic schooling happen.
…
continue reading
GenerationAI is the groundbreaking podcast designed exclusively for higher education professionals who are keen to navigate the dynamic world of Artificial Intelligence. In a landscape where AI is rapidly transforming how we teach, learn, and engage, "GenerationAI" serves as your essential guide. Each episode delves into the most pressing AI topics, breaking down complex concepts into understandable, actionable insights. Whether you're a marketer, administrator, or tech enthusiast, this show ...
…
continue reading
1
The CopDoc Podcast: Aiming for Excellence in Leadership
Dr. Steve Morreale - Host - TheCopDoc Podcast
Visit our website: https://www.copdocpodcast.com The CopDoc Podcast delves into police leadership and innovation. The focus is on aiming for excellence in the delivery of police services across the globe. Dr. Steve Morreale is a retired law enforcement practitioner, a pracademic, turned academic, and scholar from Worcester State University. Steve is the Program Director for LIFTE, Command College - The Leadership Institute for Tomorrow's Executives at Liberty University. Steve shares ideas a ...
…
continue reading
Platypod is the official podcast of the Committee for the Anthropology of Science, Technology, and Computing. We talk about anthropology, STS, and all things tech. Tune in for conversations with researchers and experts on how technology is shaping our world. (Jingle by chimerical. CC BY-NC 4.0)
…
continue reading
An interview podcast about working in publishing, featuring professionals across a wide array of roles, in both junior and senior positions, from independent and corporate publishers, in academic and trade, as well as literary agents and founders of innovative start-ups. They tell us about their career path, the latest trends in the book industry, and they give advice for jobseekers and book recommendations for all of us book lovers. Cover portrait by Ellie Beadle. Website: www.publishing-in ...
…
continue reading
Welcome to Global Commerce Exchange. Our flagship podcast grew out of Professor Peter Maillet's conviction that global awareness and understanding are more important today than ever, especially during this time of urgent social, environmental, geopolitical, and technological challenges. These conversations are aimed at connecting our McIntire community of students, alumni, and friends to the global conversation on the future of business. Now, let’s get started. Join us for twice monthly conv ...
…
continue reading
We are in the midst of a Spiritual BOOM. This world is elevating, increasing in consciousness, upping in awareness. The Great Awakening, baby. Within this, there is so much that is untrue and deceptive… even spiritually. Eden Koz and her company Just Be® created this podcast to showcase truths in politics, history, God and source, the healing arts, conspiracy theories as well as the galaxy and beyond. This is about finding yourself, your sovereignty as well as your voice. Included in each se ...
…
continue reading
Welcome to the official free Podcast site from SAGE for Public Health. SAGE is a leading international publisher of journals, books, and electronic media for academic, educational, and professional markets with principal offices in Los Angeles, London, New Delhi, and Singapore.
…
continue reading
Curbsiders Teach is THE internal medicine podcast for all things medical education. We use expert interviews to inspire the next generation of medical educators by providing listeners with teaching pearls, practice-changing knowledge, and a learning objective-based dosing of Edutainment (medical education, made entertaining). Season 3 of this weekly mini-series will air every Tuesday starting April 4, 2023 on our website or wherever you get your podcasts! We are so excited to bring you this ...
…
continue reading
1
142 Gene Ang PhD~Arcturian Healing Method: Double Agent (Academic vs Esoteric), Warrior Initiation, ETs, Galactics, Tom Kenyon/Edgar Cayce, Mt Shasta & Tibet
1:00:17
1:00:17
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
1:00:17
Currently in CA, Gene Ang, PhD talks about his academic life via Stanford and Harvard to his spiritual expansion starting with the Course in Miracles as a sophomore. We then discuss the other esoteric studies that evolved him into what we termed on the show as "avant-garde" expanding the deep connection to galactic beings, moving through dimensions…
…
continue reading
This bonus content is a reading from Platypus, the CASTAC Blog. The full post by Savannah Mandel can be read at https://blog.castac.org/2024/10/do-academics-need-agents-part-1-of-publishing-in-academia/. About the post: If you hope to someday publish with a trade press such as Penguin, Harper Collins, or Simon & Schuster, or are interested in havin…
…
continue reading
1
"Woke Right": James Lindsay is Mental
23:24
23:24
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
23:24
Buy courses here: https://www.academic-agency.com/ Sub to my substack here: https://substack.com/profile/69785136-academic-agent Join the channel here: https://www.youtube.com/channel/UCyawG3aTE7RmNQcFQskDWcw/join Merch: https://aas-house-of-merchandise.creator-spring.com/ All my vital links: https://unpopular.academy/ ... https://www.youtube.com/w…
…
continue reading
1
JC Returns to Element451 + Higher Ed's AI Reality Check as 91% of CTOs Feel Unprepared
46:00
46:00
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
46:00
In this milestone episode, Dr. JC Bonilla returns home to Element451 as COO and Head of AI Practice, joining forces again with CEO Ardis Kadiu after their 17-year journey from NYU to now. The timing is critical - new data shows 91% of higher education CTOs feel unprepared for AI adoption. The longtime friends discuss how Element451's AI-first appro…
…
continue reading
This study analyzes the differences between full fine-tuning and LoRA in large language models, revealing distinct weight matrix structures and generalization behaviors despite similar performance on tasks. https://arxiv.org/abs//2410.21228 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: ht…
…
continue reading
1
LoRA vs Full Fine-tuning: An Illusion of Equivalence
13:44
13:44
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
13:44
This study analyzes the differences between full fine-tuning and LoRA in large language models, revealing distinct weight matrix structures and generalization behaviors despite similar performance on tasks. https://arxiv.org/abs//2410.21228 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: ht…
…
continue reading
Vision-Language Models show promise in reasoning across text and images but struggle with basic visual concepts, revealing significant gaps in their understanding and generalization abilities. https://arxiv.org/abs//2410.19546 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts…
…
continue reading
Vision-Language Models show promise in reasoning across text and images but struggle with basic visual concepts, revealing significant gaps in their understanding and generalization abilities. https://arxiv.org/abs//2410.19546 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts…
…
continue reading
This study investigates the training behavior and computational requirements of Small-scale Large Language Models (SLMs), focusing on hyperparameters and configurations to enhance efficiency and support low-resource AI research. https://arxiv.org/abs//2410.19456 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_pap…
…
continue reading
This study investigates the training behavior and computational requirements of Small-scale Large Language Models (SLMs), focusing on hyperparameters and configurations to enhance efficiency and support low-resource AI research. https://arxiv.org/abs//2410.19456 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_pap…
…
continue reading
1
Special Episode: Grassroots Warrior Women~Eden, Lisa, Carole & Michelle: Matriots/Grid Work/Fem Ener/Election/Inspire
1:11:25
1:11:25
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
1:11:25
Powerhouse Michelle Schiau hosts me and two other spiritual facilitators and fellow co-anchors of the Grassroots Warrior Network on Rumble: Lisa Schermerhorn and Carole Maureen Friesen. In this beautiful round table discussion, we talk about what is going on in our world spiritually as our evolution is blasting off more than ever. This will possibl…
…
continue reading
1
[QA] Physics-informed Neural Networks for Functional Differential Equations: Cylindrical Approximation and Its Convergence Guarantees
9:12
This paper introduces a hybrid approach combining physics-informed neural networks and cylindrical approximation to efficiently solve functional differential equations, addressing computational challenges and improving numerical analysis. https://arxiv.org/abs//2410.18153 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/…
…
continue reading
1
Physics-informed Neural Networks for Functional Differential Equations: Cylindrical Approximation and Its Convergence Guarantees
19:53
19:53
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
19:53
This paper introduces a hybrid approach combining physics-informed neural networks and cylindrical approximation to efficiently solve functional differential equations, addressing computational challenges and improving numerical analysis. https://arxiv.org/abs//2410.18153 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/…
…
continue reading
1
[QA] A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
8:04
This paper shows that integrating coherent reasoning in Few-shot Chain-of-Thought prompting enhances transformer performance, revealing sensitivity to errors in intermediate steps and proposing improvements using varied reasoning paths. https://arxiv.org/abs//2410.16540 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@a…
…
continue reading
1
A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
18:20
18:20
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
18:20
This paper shows that integrating coherent reasoning in Few-shot Chain-of-Thought prompting enhances transformer performance, revealing sensitivity to errors in intermediate steps and proposing improvements using varied reasoning paths. https://arxiv.org/abs//2410.16540 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@a…
…
continue reading
LEGO is a novel technique for extracting and recombining small language models from large language models, enhancing efficiency, robustness, and user data privacy while reducing costs. https://arxiv.org/abs//2410.18287 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.c…
…
continue reading
1
LEGO: Language Model Building Blocks
16:46
16:46
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
16:46
LEGO is a novel technique for extracting and recombining small language models from large language models, enhancing efficiency, robustness, and user data privacy while reducing costs. https://arxiv.org/abs//2410.18287 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.c…
…
continue reading
1
[QA] Knowledge Distillation Using Frontier Open-Source LLMs: Generalizability and the Role of Synthetic Data
8:13
This study explores knowledge distillation from Llama-3.1-405B to smaller models, demonstrating improved accuracy and efficiency through synthetic data and diverse evaluation methods across various tasks. https://arxiv.org/abs//2410.18588 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: http…
…
continue reading
1
Knowledge Distillation Using Frontier Open-Source LLMs: Generalizability and the Role of Synthetic Data
19:45
19:45
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
19:45
This study explores knowledge distillation from Llama-3.1-405B to smaller models, demonstrating improved accuracy and efficiency through synthetic data and diverse evaluation methods across various tasks. https://arxiv.org/abs//2410.18588 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: http…
…
continue reading
1
[QA] Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers
8:09
This paper explores how Rotary Positional Embeddings (RoPE) affect Transformer model dynamics, introducing phase shifts that influence embeddings, information retention, and attention through oscillatory behaviors and frequency components. https://arxiv.org/abs//2410.18067 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com…
…
continue reading
1
Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers
20:50
20:50
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
20:50
This paper explores how Rotary Positional Embeddings (RoPE) affect Transformer model dynamics, introducing phase shifts that influence embeddings, information retention, and attention through oscillatory behaviors and frequency components. https://arxiv.org/abs//2410.18067 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com…
…
continue reading
ALTA is a new programming language and compiler that maps programs to Transformer weights, enabling loop expression and improved algorithm representation, while providing tools for analyzing training challenges. https://arxiv.org/abs//2410.18077 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcast…
…
continue reading
1
ALTA: Compiler-Based Analysis of Transformers
22:56
22:56
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
22:56
ALTA is a new programming language and compiler that maps programs to Transformer weights, enabling loop expression and improved algorithm representation, while providing tools for analyzing training challenges. https://arxiv.org/abs//2410.18077 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcast…
…
continue reading
This paper introduces UNSTAR, a novel unlearning method for large language models using anti-samples to efficiently and selectively reverse learned associations, enhancing privacy and model modification capabilities. https://arxiv.org/abs//2410.17050 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Po…
…
continue reading
1
UnStar: Unlearning with Self-Taught Anti-Sample Reasoning for LLMs
16:43
16:43
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
16:43
This paper introduces UNSTAR, a novel unlearning method for large language models using anti-samples to efficiently and selectively reverse learned associations, enhancing privacy and model modification capabilities. https://arxiv.org/abs//2410.17050 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Po…
…
continue reading
This paper explores how Knowledge Editing algorithms can unintentionally distort model representations, leading to decreased factual recall and reasoning abilities, a phenomenon termed "representation shattering." https://arxiv.org/abs//2410.17194 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podca…
…
continue reading
1
Representation Shattering in Transformers: A Synthetic Study with Knowledge Editing
18:06
18:06
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
18:06
This paper explores how Knowledge Editing algorithms can unintentionally distort model representations, leading to decreased factual recall and reasoning abilities, a phenomenon termed "representation shattering." https://arxiv.org/abs//2410.17194 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podca…
…
continue reading
1
Catching Rockets, Self-Driving Taxis, and Supercomputers: Elon Musk's Vision
57:19
57:19
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
57:19
In this eye-opening episode of Generation AI, we dive into Elon Musk's game-changing breakthroughs that are reshaping our future. From SpaceX's Starship catching rockets mid-air to Tesla's steering wheel-free robotaxis, and xAI's record-breaking supercomputer built in just 19 days, we explore how these innovations are pushing the boundaries of what…
…
continue reading
The paper proposes GenRM, a hybrid approach combining RLHF and RLAIF, improving synthetic preference labels' quality and outperforming existing models in both in-distribution and out-of-distribution tasks. https://arxiv.org/abs//2410.12832 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
1
Generative Reward Models
12:09
12:09
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
12:09
The paper proposes GenRM, a hybrid approach combining RLHF and RLAIF, improving synthetic preference labels' quality and outperforming existing models in both in-distribution and out-of-distribution tasks. https://arxiv.org/abs//2410.12832 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: htt…
…
continue reading
This bonus content is a reading from Platypus, the CASTAC Blog. The full post by Savannah Mandel can be read at https://blog.castac.org/2024/10/trade-versus-academic-press-part-2-of-publishing-in-academia/. About the post: The decision between the two publishers was not simple. It was financial. It was personal. It was intellectual. It was also ide…
…
continue reading
1
Episode 35: GETTING THE READER FROM BEGINNING TO END, WITH MERVE EMRE
59:20
59:20
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
59:20
A conversation with Shapiro-Silverberg Professor of Creative Writing and Criticism at Wesleyan University and contributing writer to The New Yorker magazine, Merve Emre. We talk about the work and goals of a book critic; what it means to think about the reader’s experience of our writing; creating a community of readers; and what it’s like to be ed…
…
continue reading
This paper presents an AI agent for error resolution in computational notebooks, enhancing bug-fixing capabilities while evaluating user experience and collaboration within the JetBrains Datalore service. https://arxiv.org/abs//2410.14393 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: http…
…
continue reading
1
Debug Smarter, Not Harder: AI Agents for Error Resolution in Computational Notebooks
10:28
10:28
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
10:28
This paper presents an AI agent for error resolution in computational notebooks, enhancing bug-fixing capabilities while evaluating user experience and collaboration within the JetBrains Datalore service. https://arxiv.org/abs//2410.14393 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: http…
…
continue reading
This study explores "dark matter" in sparse autoencoders, revealing that much unexplained variance can be predicted and proposing methods to reduce nonlinear error in model activations. https://arxiv.org/abs//2410.14670 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.…
…
continue reading
1
Decomposing The Dark Matter of Sparse Autoencoders
15:36
15:36
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
15:36
This study explores "dark matter" in sparse autoencoders, revealing that much unexplained variance can be predicted and proposing methods to reduce nonlinear error in model activations. https://arxiv.org/abs//2410.14670 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.…
…
continue reading
1
Chief Jason Armstrong's Vision for Modern Law Enforcement
54:22
54:22
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
54:22
Hey there! Send us a message. Who else should we be talking to? What topics are important? Use FanMail to connect! Let us know! The CopDoc Podcast Season 6 - Episode 140 Join us on a journey as we chat with Chief Jason Armstrong from the Apex Police Department in North Carolina, a leader who has been reshaping the face of law enforcement. Jason's p…
…
continue reading
1
[QA] A Hitchhiker's Guide to Scaling Law Estimation
10:08
10:08
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
10:08
The paper analyzes scaling laws in machine learning, providing best practices for estimating model performance using a large dataset of pretrained models and emphasizing the importance of intermediate training checkpoints. https://arxiv.org/abs//2410.11840 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Ap…
…
continue reading
1
A Hitchhiker's Guide to Scaling Law Estimation
17:28
17:28
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
17:28
The paper analyzes scaling laws in machine learning, providing best practices for estimating model performance using a large dataset of pretrained models and emphasizing the importance of intermediate training checkpoints. https://arxiv.org/abs//2410.11840 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Ap…
…
continue reading
This paper presents a novel method for image inversion and editing using rectified flow models, achieving superior performance in zero-shot tasks compared to existing diffusion model approaches. https://arxiv.org/abs//2410.10792 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcas…
…
continue reading
1
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations
10:29
10:29
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
10:29
This paper presents a novel method for image inversion and editing using rectified flow models, achieving superior performance in zero-shot tasks compared to existing diffusion model approaches. https://arxiv.org/abs//2410.10792 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcas…
…
continue reading
The paper explores whether large language models (LLMs) can introspect, finding that finetuned models can predict their own behavior, suggesting a form of internal knowledge access. https://arxiv.org/abs//2410.13787 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/…
…
continue reading
1
Looking Inward: Language Models Can Learn About Themselves by Introspection
26:21
26:21
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
26:21
The paper explores whether large language models (LLMs) can introspect, finding that finetuned models can predict their own behavior, suggesting a form of internal knowledge access. https://arxiv.org/abs//2410.13787 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/…
…
continue reading
The paper proposes a method to enhance LLMs' thinking abilities for better instruction following, improving performance across various tasks without additional human data through iterative search and optimization. https://arxiv.org/abs//2410.10630 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podca…
…
continue reading
1
Thinking LLMs: General Instruction Following with Thought Generation
15:52
15:52
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
15:52
The paper proposes a method to enhance LLMs' thinking abilities for better instruction following, improving performance across various tasks without additional human data through iterative search and optimization. https://arxiv.org/abs//2410.10630 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podca…
…
continue reading
1
[QA] Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
7:44
The paper investigates extreme-token phenomena in transformer-based LLMs, revealing mechanisms behind attention sinks and proposing strategies to mitigate their impact during pretraining. https://arxiv.org/abs//2410.13835 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.appl…
…
continue reading
1
Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs
17:44
17:44
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
17:44
The paper investigates extreme-token phenomena in transformer-based LLMs, revealing mechanisms behind attention sinks and proposing strategies to mitigate their impact during pretraining. https://arxiv.org/abs//2410.13835 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.appl…
…
continue reading
https://arxiv.org/abs//2410.13720 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
1
MOVIE GEN: A Cast of Media Foundation Models
1:53:06
1:53:06
Später Spielen
Später Spielen
Listen
Gefällt mir
Geliked
1:53:06
https://arxiv.org/abs//2410.13720 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading
https://arxiv.org/abs//2410.12557 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers --- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/supp…
…
continue reading