helloericsf's submissions

1.		Context Engineering for AI Agents: Lessons (manus.im)
		120 points by helloericsf 3 months ago \| past \| 4 comments
2.		Context Engineering for AI Agents: Lessons (manus.im)
		3 points by helloericsf 5 months ago \| past
3.		Better than DeepSeek R1? MiniMax-M1:open-weight hybrid-attention reasoning model (huggingface.co)
		6 points by helloericsf 6 months ago \| past
4.		kit - Code Intelligence Toolkit (github.com/cased)
		1 point by helloericsf 7 months ago \| past
5.		DeepSeek Open Source Optimized Parallelism Strategies, 3 repos (github.com/deepseek-ai)
		103 points by helloericsf 10 months ago \| past \| 8 comments
6.		DeepSeek Open Source DeepGEMM – FP8 GEMM Library(300 lines for 1350+ FP8 TFLOPS) (twitter.com/deepseek_ai)
		4 points by helloericsf 10 months ago \| past \| 1 comment
7.		Alibaba Open Source Large-Scale Video Generative Models: Wan2.1 (twitter.com/_akhaliq)
		8 points by helloericsf 10 months ago \| past \| 2 comments
8.		DeepSeek open source DeepEP – library for MoE training and Inference (github.com/deepseek-ai)
		536 points by helloericsf 10 months ago \| past \| 71 comments
9.		DeepSeek Open Source FlashMLA – MLA Decoding Kernel for Hopper GPUs (github.com/deepseek-ai)
		441 points by helloericsf 10 months ago \| past \| 108 comments
10.		New Qwen2.5-Max Outperforms DeepSeek V3 in Benchmarks (twitter.com/justinlin610)
		3 points by helloericsf 11 months ago \| past \| 2 comments
11.		Longest context up to 4M, MiniMax-01 hybrid 456B Open source model (github.com/minimax-ai)
		19 points by helloericsf 11 months ago \| past \| 1 comment
12.		DeepSeek v3 beats Claude sonnet 3.5 and way cheaper (huggingface.co)
		48 points by helloericsf on Dec 26, 2024 \| past \| 9 comments
13.		NeurIPS and Dr. Picard released statement for singling out Chinese scholars (twitter.com/neuripsconf)
		2 points by helloericsf on Dec 16, 2024 \| past \| 2 comments
14.		Tencent Hunyuan-Large (github.com/tencent)
		148 points by helloericsf on Nov 5, 2024 \| past \| 103 comments
15.		Chinese AI Community: open-source Heatmap (huggingface.co)
		1 point by helloericsf on July 31, 2024 \| past \| 1 comment
16.		Poolside is raising $400M+ at a $2B valuation to build a coding co-pilot (techcrunch.com)
		3 points by helloericsf on June 20, 2024 \| past \| 1 comment
17.		Is LMDeploy the Ultimate Solution? Why It Outshines VLLM, TRT-LLM, TGI, and MLC (bentoml.com)
		16 points by helloericsf on June 20, 2024 \| past \| 8 comments
18.		21.2× faster than llama.cpp? plus 40% memory usage reduction (arxiv.org)
		43 points by helloericsf on June 12, 2024 \| past \| 14 comments
19.		Databricks acquires Tabular, Snowflake fork Iceberg? (datagravity.dev)
		2 points by helloericsf on June 4, 2024 \| past \| 3 comments
20.		New Yi 1.5 models under Apache 2.0 (huggingface.co)
		2 points by helloericsf on May 12, 2024 \| past
21.		Snowflake Arctic (snowflake.com)
		1 point by helloericsf on April 25, 2024 \| past \| 1 comment
22.		Training-Free Long-Context Scaling of Large Language Models (arxiv.org)
		2 points by helloericsf on April 23, 2024 \| past
23.		Multi-agent collaboration design patterns (deeplearning.ai)
		2 points by helloericsf on April 18, 2024 \| past \| 1 comment
24.		Yi-34B, Llama 2, and common practices in LLM training (eleuther.ai)
		41 points by helloericsf on April 4, 2024 \| past \| 3 comments
25.		Cloudflare Calls – Build real-time serverless video, audio and data applications (cloudflare.com)
		8 points by helloericsf on April 4, 2024 \| past \| 1 comment