SKT unveils Korean-specialized VLM on Hugging Face

2025. 7. 30. 11:30
음성재생 설정 이동 통신망에서 음성 재생 시 데이터 요금이 발생할 수 있습니다. 글자 수 10,000자 초과 시 일부만 음성으로 제공합니다.
글자크기 설정 파란원을 좌우로 움직이시면 글자크기가 변경 됩니다.

이 글자크기로 변경됩니다.

(예시) 가장 빠른 뉴스가 있고 다양한 정보, 쌍방향 소통이 숨쉬는 다음뉴스를 만나보세요. 다음뉴스는 국내외 주요이슈와 실시간 속보, 문화생활 및 다양한 분야의 뉴스를 입체적으로 전달하고 있습니다.

SK telecom CI
SK telecom Co. announced Tuesday that it has open-sourced a new vision-language model (VLM) and document parsing technology for large language model (LLM) training on Hugging Face.

Both are based on the company’s AI platform A.X.

The newly released A.X 4.0 VL Light is a mid-sized model trained on large-scale multimodal Korean datasets. It is designed to interpret complex industrial data such as tables, graphs, and engineering schematics.

Built on the A.X 4.0 Light architecture, the model achieved an average score of 79.4 on Korean visual benchmarks, outperforming China’s Qwen 2.5-VL32B.

SK telecom also introduced the A.X Encoder, a high-speed data processing tool optimized for long documents.

Designed to support LLM training, the encoder delivers up to three times faster inference and double the training speed compared to existing models.

Operating with 149 million parameters, it achieved a natural language understanding benchmark score of 85.47, placing it at state-of-the-art (SOTA) levels globally.

With the release of these two technologies, SK telecom has unveiled a total of six models this month alone, signaling a strong push toward the national AI foundation model initiative.

Copyright © 매일경제 & mk.co.kr. 무단 전재, 재배포 및 AI학습 이용 금지