/r/MachineLearning - top ten submissions for each month of 2024

sfw subreddits | << MachineLearning 2023
2024, September
[D] I feel like ever since LLM APIs have become...
411 [D] I feel like ever since LLM APIs have become...
[P]: TensorHue – a tensor visualization library...
288 [P]: TensorHue – a tensor visualization library...
[R] Training models with multiple losses
242 [R] Training models with multiple losses
[D] How do researchers in hot topics keep up?
219 [D] How do researchers in hot topics keep up?
[D] OpenAI new reasoning model called o1
195 [D] OpenAI new reasoning model called o1
Built gpt2 in C [P]
175 Built gpt2 in C [P]
[R] What are the Top 3 most exciting research d...
124 [R] What are the Top 3 most exciting research d...
[P] Converting GPT to Llama step-by-step code g...
116 [P] Converting GPT to Llama step-by-step code g...
[D] Why is CUDA so much faster than ROCm?
107 [D] Why is CUDA so much faster than ROCm?
[P] Achieved over 100 million MNIST predictions...
105 [P] Achieved over 100 million MNIST predictions...
2024, August
[D] LLMs aren't interesting, anyone else?
301 [D] LLMs aren't interesting, anyone else?
[P] Illustrated book to learn about Transformer...
287 [P] Illustrated book to learn about Transformer...
[R] Playable 20FPS Doom via a finetuned SD1.4 m...
213 [R] Playable 20FPS Doom via a finetuned SD1.4 m...
[D] what is the hardest thing as a machine lear...
211 [D] what is the hardest thing as a machine lear...
[D] Is the new norm for NLP papers "prompt engi...
184 [D] Is the new norm for NLP papers "prompt engi...
[R] I got my first publication!
171 [R] I got my first publication!
[R] Waving Goodbye to Low-Res: A Diffusion-Wave...
161 [R] Waving Goodbye to Low-Res: A Diffusion-Wave...
[D] What industry has the worst data?
157 [D] What industry has the worst data?
[P] Updates on OpenCL backend for Pytorch
146 [P] Updates on OpenCL backend for Pytorch
[R] What’s Really Going On in Machine Learning?...
143 [R] What’s Really Going On in Machine Learning?...
2024, July
[P] ChessGPT, 100,000x smaller than GPT-4, play...
287 [P] ChessGPT, 100,000x smaller than GPT-4, play...
[D] What's the endgame for AI labs that are spe...
251 [D] What's the endgame for AI labs that are spe...
[N] Llama 3.1 405B launches
243 [N] Llama 3.1 405B launches
[P] I was struggle how Stable Diffusion works, ...
192 [P] I was struggle how Stable Diffusion works, ...
[D] What are issues in AI/ML that no one seems ...
154 [D] What are issues in AI/ML that no one seems ...
[D] Rare skills of execptional ML Engineers
150 [D] Rare skills of execptional ML Engineers
[D] Ideas on how to create a hierarchical LLM w...
129 [D] Ideas on how to create a hierarchical LLM w...
[R]  Protein language models expose viral mimic...
98 [R] Protein language models expose viral mimic...
[N] Yoshua Bengio's latest letter addressing ar...
95 [N] Yoshua Bengio's latest letter addressing ar...
[D] Scientific Machine Learning
84 [D] Scientific Machine Learning
2024, June
[N] Ilya Sutskever and friends launch Safe Supe...
253 [N] Ilya Sutskever and friends launch Safe Supe...
[P] mamba.np: pure NumPy implementation of Mamba
206 [P] mamba.np: pure NumPy implementation of Mamba
[D] Coworkers recently told me that the people ...
202 [D] Coworkers recently told me that the people ...
[D] Feeling Lost in My ML Career: Advice Needed
178 [D] Feeling Lost in My ML Career: Advice Needed
[D] "Grok" means way too many different things
174 [D] "Grok" means way too many different things
[R] Are you a reviewer for NeurIPS'24? Please r...
166 [R] Are you a reviewer for NeurIPS'24? Please r...
[D] Is anyone else absolutely besieged by paper...
155 [D] Is anyone else absolutely besieged by paper...
[D] ML Researchers in Industry: How Do You Find...
153 [D] ML Researchers in Industry: How Do You Find...
[D] Discussing Apple's Deployment of a 3 Billio...
150 [D] Discussing Apple's Deployment of a 3 Billio...
[D]LLM interview Q&amp;A
142 [D]LLM interview Q&amp;A
2024, May
[D] The "it" in AI models is really just the da...
1250 [D] The "it" in AI models is really just the da...
[N] AI engineers report burnout and rushed roll...
433 [N] AI engineers report burnout and rushed roll...
[D] How did OpenAI go from doing exciting resea...
385 [D] How did OpenAI go from doing exciting resea...
[R] KAN: Kolmogorov-Arnold Networks
374 [R] KAN: Kolmogorov-Arnold Networks
[D] AI Agents: too early, too expensive, too un...
322 [D] AI Agents: too early, too expensive, too un...
[D] Kolmogorov-Arnold Network is just an MLP
315 [D] Kolmogorov-Arnold Network is just an MLP
[P] I reproduced Anthropic's recent interpretab...
259 [P] I reproduced Anthropic's recent interpretab...
[D] What's up with papers without code?
233 [D] What's up with papers without code?
[R] Our new classification algorithm outperform...
232 [R] Our new classification algorithm outperform...
[D] Why do juniors (undergraduates or first- to...
230 [D] Why do juniors (undergraduates or first- to...
2024, April
Meta does everything OpenAI should be [D]
955 Meta does everything OpenAI should be [D]
[D] LLMs are harming AI research
830 [D] LLMs are harming AI research
[D] Llama-3 may have just killed proprietary AI...
692 [D] Llama-3 may have just killed proprietary AI...
[D] Folks here have no idea how competitive top...
600 [D] Folks here have no idea how competitive top...
Stanford releases their rather comprehensive (5...
448 Stanford releases their rather comprehensive (5...
[D] LLMs causing more harm than good for the fi...
434 [D] LLMs causing more harm than good for the fi...
[N] Meta releases Llama 3
400 [N] Meta releases Llama 3
You need everything other than ML to win a ML h...
353 You need everything other than ML to win a ML h...
[D] ML researchers who are not in NLP, what are...
295 [D] ML researchers who are not in NLP, what are...
[D] What are your horror stories from being tas...
266 [D] What are your horror stories from being tas...
2024, March
[D] When your use of AI for summary didn't come...
757 [D] When your use of AI for summary didn't come...
WSJ: The AI industry spent 17x more on Nvidia c...
608 WSJ: The AI industry spent 17x more on Nvidia c...
[D] Your salary is determined mainly by geograp...
557 [D] Your salary is determined mainly by geograp...
[N] Matrix multiplication breakthrough could le...
514 [N] Matrix multiplication breakthrough could le...
[D] Feeling burnt out after doing machine learn...
507 [D] Feeling burnt out after doing machine learn...
[P] How I found 8 bugs in Google's Gemma 6T tok...
473 [P] How I found 8 bugs in Google's Gemma 6T tok...
[R] Analysis of 300+ ML competitions in 2023
444 [R] Analysis of 300+ ML competitions in 2023
[D] What are some well-written ML codebases to ...
394 [D] What are some well-written ML codebases to ...
[N] How Stability AI’s Founder Tanked His Billi...
381 [N] How Stability AI’s Founder Tanked His Billi...
[R] Up to 17% of Recent AI Conference Peer Revi...
351 [R] Up to 17% of Recent AI Conference Peer Revi...
2024, February
[D] Off my chest. I'm doing PhD in ML, and I'm ...
950 [D] Off my chest. I'm doing PhD in ML, and I'm ...
[D] Is the tech industry still not recovered or...
629 [D] Is the tech industry still not recovered or...
[R] The Era of 1-bit LLMs: All Large Language M...
473 [R] The Era of 1-bit LLMs: All Large Language M...
[D] OpenAI Sora Video Gen -- How??
392 [D] OpenAI Sora Video Gen -- How??
[P] Chess-GPT, 1000x smaller than GPT-4, plays ...
381 [P] Chess-GPT, 1000x smaller than GPT-4, plays ...
[D] Does anyone else feel like there's an entir...
324 [D] Does anyone else feel like there's an entir...
The industry is not going "recover" for newly m...
295 The industry is not going "recover" for newly m...
[News] Google release new and open llm model: g...
296 [News] Google release new and open llm model: g...
[N] Gemini 1.5, MoE with 1M tokens of context-l...
290 [N] Gemini 1.5, MoE with 1M tokens of context-l...
[D] Why do researchers so rarely release traini...
272 [D] Why do researchers so rarely release traini...
2024, January
[R] Google DeepMind Diagnostic LLM Exceeds Huma...
562 [R] Google DeepMind Diagnostic LLM Exceeds Huma...
What do you think about Yann Lecun's controvers...
459 What do you think about Yann Lecun's controvers...
Most things we have today in AI will be a irrel...
393 Most things we have today in AI will be a irrel...
[D] Scikit-Learn fixed its F-1 score calculator...
393 [D] Scikit-Learn fixed its F-1 score calculator...
[D] How does our brain prevent overfitting?
369 [D] How does our brain prevent overfitting?
[D] Data scientists who made a passive income, ...
354 [D] Data scientists who made a passive income, ...
[D] What is your honest experience with reinfor...
325 [D] What is your honest experience with reinfor...
[D] So, Mamba vs. Transformers... is the hype r...
302 [D] So, Mamba vs. Transformers... is the hype r...
[D] 3 years doing ML, no success yet. Is it com...
295 [D] 3 years doing ML, no success yet. Is it com...
[D] How do you deal with unreasonable request f...
279 [D] How do you deal with unreasonable request f...