/r/MachineLearning - top ten submissions for each month of 2024
sfw subreddits
| <<
MachineLearning 2023
2024, September
411 [D] I feel like ever since LLM APIs have become...
288 [P]: TensorHue – a tensor visualization library...
242 [R] Training models with multiple losses
219 [D] How do researchers in hot topics keep up?
195 [D] OpenAI new reasoning model called o1
175 Built gpt2 in C [P]
124 [R] What are the Top 3 most exciting research d...
116 [P] Converting GPT to Llama step-by-step code g...
107 [D] Why is CUDA so much faster than ROCm?
105 [P] Achieved over 100 million MNIST predictions...
2024, August
301 [D] LLMs aren't interesting, anyone else?
287 [P] Illustrated book to learn about Transformer...
213 [R] Playable 20FPS Doom via a finetuned SD1.4 m...
211 [D] what is the hardest thing as a machine lear...
184 [D] Is the new norm for NLP papers "prompt engi...
171 [R] I got my first publication!
161 [R] Waving Goodbye to Low-Res: A Diffusion-Wave...
157 [D] What industry has the worst data?
146 [P] Updates on OpenCL backend for Pytorch
143 [R] What’s Really Going On in Machine Learning?...
2024, July
287 [P] ChessGPT, 100,000x smaller than GPT-4, play...
251 [D] What's the endgame for AI labs that are spe...
243 [N] Llama 3.1 405B launches
192 [P] I was struggle how Stable Diffusion works, ...
154 [D] What are issues in AI/ML that no one seems ...
150 [D] Rare skills of execptional ML Engineers
129 [D] Ideas on how to create a hierarchical LLM w...
98 [R] Protein language models expose viral mimic...
95 [N] Yoshua Bengio's latest letter addressing ar...
84 [D] Scientific Machine Learning
2024, June
253 [N] Ilya Sutskever and friends launch Safe Supe...
206 [P] mamba.np: pure NumPy implementation of Mamba
202 [D] Coworkers recently told me that the people ...
178 [D] Feeling Lost in My ML Career: Advice Needed
174 [D] "Grok" means way too many different things
166 [R] Are you a reviewer for NeurIPS'24? Please r...
155 [D] Is anyone else absolutely besieged by paper...
153 [D] ML Researchers in Industry: How Do You Find...
150 [D] Discussing Apple's Deployment of a 3 Billio...
142 [D]LLM interview Q&A
2024, May
1250 [D] The "it" in AI models is really just the da...
433 [N] AI engineers report burnout and rushed roll...
385 [D] How did OpenAI go from doing exciting resea...
374 [R] KAN: Kolmogorov-Arnold Networks
322 [D] AI Agents: too early, too expensive, too un...
315 [D] Kolmogorov-Arnold Network is just an MLP
259 [P] I reproduced Anthropic's recent interpretab...
233 [D] What's up with papers without code?
232 [R] Our new classification algorithm outperform...
230 [D] Why do juniors (undergraduates or first- to...
2024, April
955 Meta does everything OpenAI should be [D]
830 [D] LLMs are harming AI research
692 [D] Llama-3 may have just killed proprietary AI...
600 [D] Folks here have no idea how competitive top...
448 Stanford releases their rather comprehensive (5...
434 [D] LLMs causing more harm than good for the fi...
400 [N] Meta releases Llama 3
353 You need everything other than ML to win a ML h...
295 [D] ML researchers who are not in NLP, what are...
266 [D] What are your horror stories from being tas...
2024, March
757 [D] When your use of AI for summary didn't come...
608 WSJ: The AI industry spent 17x more on Nvidia c...
557 [D] Your salary is determined mainly by geograp...
514 [N] Matrix multiplication breakthrough could le...
507 [D] Feeling burnt out after doing machine learn...
473 [P] How I found 8 bugs in Google's Gemma 6T tok...
444 [R] Analysis of 300+ ML competitions in 2023
394 [D] What are some well-written ML codebases to ...
381 [N] How Stability AI’s Founder Tanked His Billi...
351 [R] Up to 17% of Recent AI Conference Peer Revi...
2024, February
950 [D] Off my chest. I'm doing PhD in ML, and I'm ...
629 [D] Is the tech industry still not recovered or...
473 [R] The Era of 1-bit LLMs: All Large Language M...
392 [D] OpenAI Sora Video Gen -- How??
381 [P] Chess-GPT, 1000x smaller than GPT-4, plays ...
324 [D] Does anyone else feel like there's an entir...
295 The industry is not going "recover" for newly m...
296 [News] Google release new and open llm model: g...
290 [N] Gemini 1.5, MoE with 1M tokens of context-l...
272 [D] Why do researchers so rarely release traini...
2024, January
562 [R] Google DeepMind Diagnostic LLM Exceeds Huma...
459 What do you think about Yann Lecun's controvers...
393 Most things we have today in AI will be a irrel...
393 [D] Scikit-Learn fixed its F-1 score calculator...
369 [D] How does our brain prevent overfitting?
354 [D] Data scientists who made a passive income, ...
325 [D] What is your honest experience with reinfor...
302 [D] So, Mamba vs. Transformers... is the hype r...
295 [D] 3 years doing ML, no success yet. Is it com...
279 [D] How do you deal with unreasonable request f...