PIPC presents safe application of synthetic data at technology forum

2024. 5. 31. 09:45
글자크기 설정 파란원을 좌우로 움직이시면 글자크기가 변경 됩니다.

이 글자크기로 변경됩니다.

(예시) 가장 빠른 뉴스가 있고 다양한 정보, 쌍방향 소통이 숨쉬는 다음뉴스를 만나보세요. 다음뉴스는 국내외 주요이슈와 실시간 속보, 문화생활 및 다양한 분야의 뉴스를 입체적으로 전달하고 있습니다.

[Courtesy of The Personal Information Protection Commission]
South Korea’s Personal Information Protection Commission (PIPC) and the Korea Internet & Security Agency (KISA) on Thursday held a seminar on synthetic data, a key technology in the era of a data economy.

Synthetic data refers to newly generated virtual data that only reference the characteristics of various existing information.

It differs from the concept of de-identification processing (pseudonymization or anonymization), which involves transforming some or all personal information.

When appropriately generated, synthetic data can be used without the legal restrictions required to use personal data.

As synthetic data is virtual, it can also be safely used even when sensitive information is involved, addressing concerns about personal information breaches. This makes it a notable privacy protection technology.

The key aspect of synthetic data is to maintain the utility of the actual data while ensuring that individuals cannot be identified.

One representative method is a generative adversarial neural network (GAN), which creates an artificial intelligence (AI) to generate fake data and another AI to distinguish between real and fake data, and trains the two AIs in a competitive manner to create sophisticated synthetic data.

The PIPC plans to release five reference models for synthetic data generation next week, which will cover oral images, safety helmet-wearing images, blood sugar measurement information, telecom membership usage details, and information on corporate shareholders and representatives.

Copyright © 매일경제 & mk.co.kr. 무단 전재, 재배포 및 AI학습 이용 금지

이 기사에 대해 어떻게 생각하시나요?