korea

High signal Matched: generation, moe, performance, model, weights, paper, research, evaluation, korea, korean, seoul, naver, training, fine-tuning, quantization, agent, agents, agentic

Nota AI · korea · 2026-04-22

[Deep Dive: NetsPresso®] From Quantization to Graph Optimization: A Step-by-Step Model Deployment Pipeline

Score 54

  Jaehoon Lee Technical Content Manager, Nota AI   Series Notice: NetsPresso® Technical Blog, Part 2In Part 1, we walked through a scenario of deploying Llama 3.2 1B on an edge device to illustrate the NetsPresso® workflow. The f...

inference kernel cuda benchmark hardware model-release research korea training quantization evals api open-source

High signal Matched: inference, kernel, cuda, matmul, benchmark, performance, latency, cost, npu, model, weights, paper, research, evaluation, furiosa, training, quantization, int8, int4, awq, gptq, sdk, open-source

SqueezeBits · korea · 2026-04-14

Recap: 2nd vLLM Korea Meetup 2026

Score 12

Check out highlights from the 2nd vLLM Korea Meetup! open-source use cases and real-world production examples that showcase vLLM's technical maturity!

korea open-source

High signal Matched: korea, open-source

vLLM Project · open-source · 2026-04-14

vLLM Korea Meetup 2026 Wrap-Up

Score 16

Hosted by the vLLM KR Community, with support from Rebellions, SqueezeBits, Red Hat APAC, and PyTorch Korea, the vLLM Korea Meetup 2026 was held in Seoul on April 2nd.

High signal Matched: korea, seoul, rebellions

Rebellions · hardware · 2026-04-13

2026 vLLM Korea Meetup

Score 14

vLLM KR 커뮤니티가 주관하고, 리벨리온(Rebellions), SqueezeBits, Red Hat APAC, PyTorch Korea가 함께한 vLLM Korea Meetup 2026이 4월 2일 서울에서 열렸습니다.... The post 2026 vLLM Korea Meetup appeared first on Rebellions.

High signal Matched: korea, rebellions

Rebellions · hardware · 2026-04-02

NPU 서버 기반 피지컬 AI, 아랍에미리트(UAE) 수질 정화 로봇 솔루션

Score 14

Summary Challenge 석유 및 가스 산업이 발달한 중동 지역에서는 원유 생산 과정에서 불가피하게 발생하는 폐수와 기름을 처리해야 합니다. 특히, 저수지와... The post NPU 서버 기반 피지컬 AI, 아랍에미리트(UAE) 수질 정화 로봇 솔루션 appeared first on Rebellions.

inference kv-cache moe benchmark model-release research korea quantization

High signal Matched: npu, rebellions

Nota AI · korea · 2026-03-20

GenAI Everywhere: The Future of Edge AI Optimization with the New NetsPresso®

Score 26

  NP Product Team, Nota AI   The role of Edge AI is rapidly expanding.Offline voice assistants now carry on conversations in our daily lives, vehicles infer routes in real time, and smartphones generate images without a network c...

inference serving moe benchmark hardware model-release research korea training quantization evals long-context open-source

High signal Matched: inference, kv cache, moe, benchmark, performance, latency, cost, model, research, seoul, quantization

Nota AI · korea · 2026-03-13

NotaMoEQuantization: An MoE-Specific Quantization Method for Solar-Open-100B

Score 62

  Hancheol Park, Ph. D. AI Research Engineer, Nota AI Tairen PiaoAI Research Engineer, Nota AI Tae-Ho KimCTO & Co-Founder, Nota AI ✔️ Resource : The official quantized model of Solar-Open-100B, which passed the first round of Sout...

High signal Matched: inference, serving, prefill, generation, throughput, moe, router, benchmark, performance, latency, ttft, tpot, blackwell, release, model, weights, open model, research, evaluation, korea, korean, upstage, training, post-training, quantization, quantized, int4, evaluate, benchmarks, mmlu, long-context

Rebellions · hardware · 2025-12-29

LLM/RAG 기반 몽골 관세청 물품 분류 코드 AI 추천 챗봇

Score 10

Summary Challenge 관세청은 매년 방대한 양의 수출입 신고서를 처리하며, 각 품목에 적합한 HS 코드(Harmonized System Code)를 정확하게 분류해야 하는 업무를... The post LLM/RAG 기반 몽골 관세청 물품 분류 코드 AI 추천 챗봇 appeared first on Rebellions.

korea rag

inference serving benchmark hardware model-release korea

High signal Matched: rebellions, rag

SqueezeBits · korea · 2025-12-24

Introducing rebellions ATOM™-MAX

Score 24

Introducing ATOM™-Max, rebellions’ next-generation NPU designed for high-performance AI inference. Learn how its runtime, profiling tools, and PyTorch-native integrations enable developers to run and serve models efficiently without sacrif...

High signal Matched: inference, generation, serve, performance, npu, introducing, rebellions

SqueezeBits · korea · 2025-12-10

vLLM Hands-on Workshop with Rebellions & SqueezeBits: A Recap

Score 12

Rebellions and SqueezeBits Co-Host a vLLM Hands-on Workshop: Workshop Highlights, PyTorch Best Practices, Performance Optimization, and Developer First-Hand Tips!

benchmark korea

High signal Matched: performance, rebellions

Rebellions · hardware · 2025-11-20

NPU로 구동되는 AI 기반 동물 영상 진단 보조 서비스

Score 14

Summary Challenge 최근 반려동물 양육 인구의 증가로 X-ray 영상 진단 수요가 빠르게 확대되고 있습니다. 그러나 국내 영상의학 전공 수의사는 수백... The post NPU로 구동되는 AI 기반 동물 영상 진단 보조 서비스 appeared first on Rebellions.

High signal Matched: npu, rebellions

Rebellions · hardware · 2025-11-07

vLLM Hands-on Workshop WrapUp

Score 14

리벨리온 NPU에서 직접 경험한 LLM 추론의 새로운 가능성 지난 8월 vLLM Korea Meetup에 이어, 10월 29일 리벨리온과 스퀴즈비츠 주관으로 vLLM... The post vLLM Hands-on Workshop WrapUp appeared first on Rebellions.

High signal Matched: npu, korea, rebellions

Rebellions · hardware · 2025-10-20

지속 가능한 AI 확장을 위하여: 데이터센터 연산과 전력 공급의 혁신

Score 10

Summary Challenge 초대형 AI 시설은 이미 소도시 규모의 전력을 소비하고 있습니다. 단일 사이트의 수요가 100~200MW에 달해 소형 원자로급 수준입니다. AI... The post 지속 가능한 AI 확장을 위하여: 데이터센터 연산과 전력 공급의 혁신 appeared first on Rebellions.

High signal Matched: rebellions

Rebellions · hardware · 2025-09-17

The First vLLM Meetup in Korea

Score 14

리벨리온(Rebellions)과 레드햇(Rad Hat)이 주최하고 파이토치 코리아와 스퀴즈비츠(SqueezeBits)가 함께 기획한 제1회 vLLM 커뮤니티 밋업 코리아 행사가 2025년 8월 19일 서울에서 열렸습니다.... The post The First vLLM Meetup in Korea appeared first on Rebellions.

High signal Matched: korea, rebellions

Rebellions · hardware · 2025-08-21

AI로 예방 중심의 건설 & 플랜트 프로젝트 현장 안전 관리 실현

Score 14

비전 모델과 언어 모델을 결합한 멀티모달, GPU와 NPU를 결합한 하이브리드 인프라로 기존 시스템의 제약을 극복하는 차별화된 AI 기반 안전 관제 시스템, ‘AI 비전 인텔리전스'를 개발한 코오롱베니트의 사례 The post AI로 예방 중심의 건설 & 플랜트 프로젝트 현장 안전 관리 실현 appeared first on Rebellions.

High signal Matched: gpu, npu, rebellions

Rebellions · hardware · 2025-08-21

SOC의 보안 위협 탐지와 대응에 LLM 기반 AI 접목

Score 10

Summary Challenge 현대의 보안관제센터(Security Operation Center, SOC)는 세 가지 과제를 동시에 해결해야 하는 트릴레마(Trilemma) 상황에 놓여 있습니다. 새로운 유형의 공격을... The post SOC의 보안 위협 탐지와 대응에 LLM 기반 AI 접목 appeared first on Rebellions.

High signal Matched: rebellions

Rebellions · hardware · 2025-08-21

학습용 현실 데이터 생성: 생성형 AI로 구현하는 Physical AI

Score 10

Physical AI를 위한 로봇 학습용 데이터 생성과 활용 방안은? Physical AI가 도입되어 실제 환경과 AI가 상호작용하기 위해서는 모델이 매우 정교하게... The post 학습용 현실 데이터 생성: 생성형 AI로 구현하는 Physical AI appeared first on Rebellions.