/r/MachineLearning - top ten submissions for each month of 2024

sfw subreddits | << MachineLearning 2023 | MachineLearning 2025 >>

2024, December

[D]Stuck in AI Hell: What to do in post LLM world

828 [D]Stuck in AI Hell: What to do in post LLM world

[D] The winner of the NeurIPS 2024 Best Paper A...

700 [D] The winner of the NeurIPS 2024 Best Paper A...

[D] Can we please stop using "is all we need" i...

689 [D] Can we please stop using "is all we need" i...

[D] What happened at NeurIPS?

632 [D] What happened at NeurIPS?

[P] I made wut – a CLI that explains your last ...

549 [P] I made wut – a CLI that explains your last ...

[N] Sama, an AI sweatshop, pays workers in Keny...

324 [N] Sama, an AI sweatshop, pays workers in Keny...

[P] I made Termite – a CLI that can generate te...

312 [P] I made Termite – a CLI that can generate te...

[D] OpenAI o3 87.5% High Score on ARC Prize Cha...

277 [D] OpenAI o3 87.5% High Score on ARC Prize Cha...

[D] - Why MAMBA did not catch on?

254 [D] - Why MAMBA did not catch on?

[D] i sensed anxiety and frustration at NeurIPS...

213 [D] i sensed anxiety and frustration at NeurIPS...

2024, November

[R] Must-Read ML Theory Papers

435 [R] Must-Read ML Theory Papers

[P] Analysis of why UMAP is so fast

422 [P] Analysis of why UMAP is so fast

[P] I made a library for building agents that u...

284 [P] I made a library for building agents that u...

[D] Accepted NeurIPS 2024 paper claimed to be s...

277 [D] Accepted NeurIPS 2024 paper claimed to be s...

[D] What’s the most surprising or counterintuit...

261 [D] What’s the most surprising or counterintuit...

[D] Theory behind modern diffusion models

231 [D] Theory behind modern diffusion models

[D] Is there an alternative to Science Twitter/X?

226 [D] Is there an alternative to Science Twitter/X?

[D] What’s a machine learning paper or research...

195 [D] What’s a machine learning paper or research...

[D] Why ML PhD is so competitive?

194 [D] Why ML PhD is so competitive?

[D] AMA: I’m Head of AI at a firm in the UK, ad...

172 [D] AMA: I’m Head of AI at a firm in the UK, ad...

2024, October

[N] 2024 Nobel Prize for Physics goes to ML and...

1158 [N] 2024 Nobel Prize for Physics goes to ML and...

[D] Why do PhD Students in the US seem like ove...

1087 [D] Why do PhD Students in the US seem like ove...

[N] The 2024 Nobel Prize in Chemistry goes to t...

420 [N] The 2024 Nobel Prize in Chemistry goes to t...

[P] Drowning in Research Papers? ?

349 [P] Drowning in Research Papers? ?

[N] Jurgen Schmidhuber on 2024 Physics Nobel Prize

351 [N] Jurgen Schmidhuber on 2024 Physics Nobel Prize

[D] Transformers are a type of CNN

326 [D] Transformers are a type of CNN

[D] PyTorch 2.5.0 released!

308 [D] PyTorch 2.5.0 released!

[P] Just-in-Time Implementation: A Python Libra...

300 [P] Just-in-Time Implementation: A Python Libra...

[D] Is it common for ML researchers to tweak co...

291 [D] Is it common for ML researchers to tweak co...

[R] Were RNNs All We Needed?

249 [R] Were RNNs All We Needed?

2024, September

[D] I feel like ever since LLM APIs have become...

411 [D] I feel like ever since LLM APIs have become...

[P]: TensorHue – a tensor visualization library...

288 [P]: TensorHue – a tensor visualization library...

[R] Training models with multiple losses

242 [R] Training models with multiple losses

[D] How do researchers in hot topics keep up?

219 [D] How do researchers in hot topics keep up?

[D] OpenAI new reasoning model called o1

195 [D] OpenAI new reasoning model called o1

Built gpt2 in C [P]

175 Built gpt2 in C [P]

[R] What are the Top 3 most exciting research d...

124 [R] What are the Top 3 most exciting research d...

[P] Converting GPT to Llama step-by-step code g...

116 [P] Converting GPT to Llama step-by-step code g...

[D] Why is CUDA so much faster than ROCm?

107 [D] Why is CUDA so much faster than ROCm?

[P] Achieved over 100 million MNIST predictions...

105 [P] Achieved over 100 million MNIST predictions...

2024, August

[D] LLMs aren't interesting, anyone else?

301 [D] LLMs aren't interesting, anyone else?

[P] Illustrated book to learn about Transformer...

287 [P] Illustrated book to learn about Transformer...

[R] Playable 20FPS Doom via a finetuned SD1.4 m...

213 [R] Playable 20FPS Doom via a finetuned SD1.4 m...

[D] what is the hardest thing as a machine lear...

211 [D] what is the hardest thing as a machine lear...

[D] Is the new norm for NLP papers "prompt engi...

184 [D] Is the new norm for NLP papers "prompt engi...

[R] I got my first publication!

171 [R] I got my first publication!

[R] Waving Goodbye to Low-Res: A Diffusion-Wave...

161 [R] Waving Goodbye to Low-Res: A Diffusion-Wave...

[D] What industry has the worst data?

157 [D] What industry has the worst data?

[P] Updates on OpenCL backend for Pytorch

146 [P] Updates on OpenCL backend for Pytorch

[R] What’s Really Going On in Machine Learning?...

143 [R] What’s Really Going On in Machine Learning?...

2024, July

[P] ChessGPT, 100,000x smaller than GPT-4, play...

287 [P] ChessGPT, 100,000x smaller than GPT-4, play...

[D] What's the endgame for AI labs that are spe...

251 [D] What's the endgame for AI labs that are spe...

[N] Llama 3.1 405B launches

243 [N] Llama 3.1 405B launches

[P] I was struggle how Stable Diffusion works, ...

192 [P] I was struggle how Stable Diffusion works, ...

[D] What are issues in AI/ML that no one seems ...

154 [D] What are issues in AI/ML that no one seems ...

[D] Rare skills of execptional ML Engineers

150 [D] Rare skills of execptional ML Engineers

[D] Ideas on how to create a hierarchical LLM w...

129 [D] Ideas on how to create a hierarchical LLM w...

[R] Protein language models expose viral mimic...

98 [R] Protein language models expose viral mimic...

[N] Yoshua Bengio's latest letter addressing ar...

95 [N] Yoshua Bengio's latest letter addressing ar...

[D] Scientific Machine Learning

84 [D] Scientific Machine Learning

2024, June

[N] Ilya Sutskever and friends launch Safe Supe...

253 [N] Ilya Sutskever and friends launch Safe Supe...

[P] mamba.np: pure NumPy implementation of Mamba

206 [P] mamba.np: pure NumPy implementation of Mamba

[D] Coworkers recently told me that the people ...

202 [D] Coworkers recently told me that the people ...

[D] Feeling Lost in My ML Career: Advice Needed

178 [D] Feeling Lost in My ML Career: Advice Needed

[D] "Grok" means way too many different things

174 [D] "Grok" means way too many different things

[R] Are you a reviewer for NeurIPS'24? Please r...

166 [R] Are you a reviewer for NeurIPS'24? Please r...

[D] Is anyone else absolutely besieged by paper...

155 [D] Is anyone else absolutely besieged by paper...

[D] ML Researchers in Industry: How Do You Find...

153 [D] ML Researchers in Industry: How Do You Find...

[D] Discussing Apple's Deployment of a 3 Billio...

150 [D] Discussing Apple's Deployment of a 3 Billio...

[D]LLM interview Q&A

142 [D]LLM interview Q&A

2024, May

[D] The "it" in AI models is really just the da...

1250 [D] The "it" in AI models is really just the da...

[N] AI engineers report burnout and rushed roll...

433 [N] AI engineers report burnout and rushed roll...

[D] How did OpenAI go from doing exciting resea...

385 [D] How did OpenAI go from doing exciting resea...

[R] KAN: Kolmogorov-Arnold Networks

374 [R] KAN: Kolmogorov-Arnold Networks

[D] AI Agents: too early, too expensive, too un...

322 [D] AI Agents: too early, too expensive, too un...

[D] Kolmogorov-Arnold Network is just an MLP

315 [D] Kolmogorov-Arnold Network is just an MLP

[P] I reproduced Anthropic's recent interpretab...

259 [P] I reproduced Anthropic's recent interpretab...

[D] What's up with papers without code?

233 [D] What's up with papers without code?

[R] Our new classification algorithm outperform...

232 [R] Our new classification algorithm outperform...

[D] Why do juniors (undergraduates or first- to...

230 [D] Why do juniors (undergraduates or first- to...

2024, April

Meta does everything OpenAI should be [D]

955 Meta does everything OpenAI should be [D]

[D] LLMs are harming AI research

830 [D] LLMs are harming AI research

[D] Llama-3 may have just killed proprietary AI...

692 [D] Llama-3 may have just killed proprietary AI...

[D] Folks here have no idea how competitive top...

600 [D] Folks here have no idea how competitive top...

Stanford releases their rather comprehensive (5...

448 Stanford releases their rather comprehensive (5...

[D] LLMs causing more harm than good for the fi...

434 [D] LLMs causing more harm than good for the fi...

[N] Meta releases Llama 3

400 [N] Meta releases Llama 3

You need everything other than ML to win a ML h...

353 You need everything other than ML to win a ML h...

[D] ML researchers who are not in NLP, what are...

295 [D] ML researchers who are not in NLP, what are...

[D] What are your horror stories from being tas...

266 [D] What are your horror stories from being tas...

2024, March

[D] When your use of AI for summary didn't come...

757 [D] When your use of AI for summary didn't come...

WSJ: The AI industry spent 17x more on Nvidia c...

608 WSJ: The AI industry spent 17x more on Nvidia c...

[D] Your salary is determined mainly by geograp...

557 [D] Your salary is determined mainly by geograp...

[N] Matrix multiplication breakthrough could le...

514 [N] Matrix multiplication breakthrough could le...

[D] Feeling burnt out after doing machine learn...

507 [D] Feeling burnt out after doing machine learn...

[P] How I found 8 bugs in Google's Gemma 6T tok...

473 [P] How I found 8 bugs in Google's Gemma 6T tok...

[R] Analysis of 300+ ML competitions in 2023

444 [R] Analysis of 300+ ML competitions in 2023

[D] What are some well-written ML codebases to ...

394 [D] What are some well-written ML codebases to ...

[N] How Stability AI’s Founder Tanked His Billi...

381 [N] How Stability AI’s Founder Tanked His Billi...

[R] Up to 17% of Recent AI Conference Peer Revi...

351 [R] Up to 17% of Recent AI Conference Peer Revi...

2024, February

[D] Off my chest. I'm doing PhD in ML, and I'm ...

950 [D] Off my chest. I'm doing PhD in ML, and I'm ...

[D] Is the tech industry still not recovered or...

629 [D] Is the tech industry still not recovered or...

[R] The Era of 1-bit LLMs: All Large Language M...

473 [R] The Era of 1-bit LLMs: All Large Language M...

[D] OpenAI Sora Video Gen -- How??

392 [D] OpenAI Sora Video Gen -- How??

[P] Chess-GPT, 1000x smaller than GPT-4, plays ...

381 [P] Chess-GPT, 1000x smaller than GPT-4, plays ...

[D] Does anyone else feel like there's an entir...

324 [D] Does anyone else feel like there's an entir...

The industry is not going "recover" for newly m...

295 The industry is not going "recover" for newly m...

[News] Google release new and open llm model: g...

296 [News] Google release new and open llm model: g...

[N] Gemini 1.5, MoE with 1M tokens of context-l...

290 [N] Gemini 1.5, MoE with 1M tokens of context-l...

[D] Why do researchers so rarely release traini...

272 [D] Why do researchers so rarely release traini...

2024, January

[R] Google DeepMind Diagnostic LLM Exceeds Huma...

562 [R] Google DeepMind Diagnostic LLM Exceeds Huma...

What do you think about Yann Lecun's controvers...

459 What do you think about Yann Lecun's controvers...

Most things we have today in AI will be a irrel...

393 Most things we have today in AI will be a irrel...

[D] Scikit-Learn fixed its F-1 score calculator...

393 [D] Scikit-Learn fixed its F-1 score calculator...

[D] How does our brain prevent overfitting?

369 [D] How does our brain prevent overfitting?

[D] Data scientists who made a passive income, ...

354 [D] Data scientists who made a passive income, ...

[D] What is your honest experience with reinfor...

325 [D] What is your honest experience with reinfor...

[D] So, Mamba vs. Transformers... is the hype r...

302 [D] So, Mamba vs. Transformers... is the hype r...

[D] 3 years doing ML, no success yet. Is it com...

295 [D] 3 years doing ML, no success yet. Is it com...

[D] How do you deal with unreasonable request f...

279 [D] How do you deal with unreasonable request f...