Train AI and do not remove resident-passport number… Government: “Complete personal information protection vulnerabilities”

by times news cr March 28, 2024

March 28, 2024

2024-03-28 14:24:50

A government investigation found that major big tech companies that provide generative artificial intelligence (AI) services such as ChatGPT do not properly remove sensitive personal information such as resident registration numbers and passport numbers when training AI. As there are concerns that personal information may be leaked indiscriminately, the government has advised companies to address vulnerabilities.

The Personal Information Protection Committee (Personal Information Protection Committee) held a general meeting on the 27th and decided to recommend to six companies, including Open AI, Google, Microsoft, Meta, Naver, and Rutten, to “complement vulnerabilities in personal information protection.” . These companies provide AI services or develop and distribute large-scale language models for them.

As generative AI services are rapidly expanding, the Personal Information Commission has conducted preliminary inspections of major AI services with the Korea Internet & Security Agency since November last year. As a result, it was confirmed that personal information such as resident registration number, passport number, and credit card number was not removed from the information entered into the AI service.

A large-scale language model is a type of deep learning technology that inputs a large amount of text and outputs natural language appropriate for the given situation. Even if personal information is included in the input data, it can be prevented from being exposed through its own filtering technology. However, there are cases where personal information is exposed due to system errors, so it is safer to remove information at the input stage in advance.

In fact, in July of last year, Google researchers discovered that when the command “repeat the word poem infinitely” was entered into ChatGPT, an error occurred in the filtering system and personal information such as phone numbers and emails were exposed. In December of last year, the Personal Information Commission noticed similar problems occurring in other generative AI services based on Open AI and informed operators.

The reason personal information is indiscriminately included in learning data is because large-scale language model operators collect information using ‘crawling’ technology that randomly searches information on the web. Programs can be designed so as not to extract sensitive personal information. However, because the amount of data is vast and the data formats are all different, there is a high possibility that personal information will be included regardless of the information subject’s will.

The Personal Information Commission recommended these operators to increase accessibility so that AI service users can view and easily remove and delete entered data.

Reporter Joo Hyun-woo woojoo@donga.com

Hot news now

2024-03-28 14:24:50

Related

0 comment 0 Facebook Twitter Pinterest Email

previous post

‘Continental mistake’ Xiaomi launches first electric car, expected price lowest 37 million won

next post

In Ecuador, women surpass men in the adoption of digital payment methods, according to a study – 2024-03-28 14:26:32

You may also like

Ecuador promoted the best of its offer of...
April 28, 2024

K. Hatzidakis: Aid of 600 million euros to...
April 28, 2024

Boeing and Airbus leave! China gives 11 billion...
April 28, 2024

“What if even the kimchi goes up?”… Industry...
April 28, 2024

the works of this goldsmith are not only...
April 28, 2024

Today’s winners April 23
April 28, 2024

SIAM 2024: award ceremony to the winning breeders...
April 28, 2024

The European Union imposes tougher rules on Chinese...
April 28, 2024

The Rise of Installment Purchase Apps in Venezuela:...
April 28, 2024

Oil exceeds $90 for the first time since...
April 28, 2024

Leave a Comment Cancel Reply

Save my name, email, and website in this browser for the next time I comment.

Δ

Recent Posts

Secretary of Communication, Roberto Izurieta, says that the Mazar dam was emptied – 2024-04-28 08:27:20

April 28, 2024

Assembly condemns the kidnapping of a legislator from President Noboa’s movement – 2024-04-28 08:26:17

April 28, 2024

Ecuador promoted the best of its offer of organic and healthy products at the Natural Products Expo West fair in the United States – 2024-04-28 08:25:09

April 28, 2024

Why urine color is a key indicator of a person’s health – 2024-04-28 08:23:21

April 28, 2024

Workers receive a strong electric shock in different parts of the country – 2024-04-28 08:22:08

April 28, 2024

Recommended:

“The Sleepy Girl Mocktail: Exploring the Sleep Benefits of Tart Cherry Juice and Magnesium Powder”

“Justin Timberlake Drops New Single and Video ‘Selfish’ as Part of Highly Anticipated Album Release”

“Boeing 737 Max 9 Planes Cleared for Flight: FAA Review Complete”

“Study Reveals How Patients Successfully Maintain Weight After Stopping Anti-Obesity Medications”

“Japan’s SLIM Moon Lander Makes Historic Landing, Setting New Trends in Lunar Exploration”

Secretary of Communication, Roberto Izurieta, says that the Mazar dam was emptied – 2024-04-28 08:27:20

Assembly condemns the kidnapping of a legislator from President Noboa’s movement – 2024-04-28 08:26:17

Ecuador promoted the best of its offer of organic and healthy products at the Natural Products Expo West fair in the United States – 2024-04-28 08:25:09

Why urine color is a key indicator of a person’s health – 2024-04-28 08:23:21

Workers receive a strong electric shock in different parts of the country – 2024-04-28 08:22:08