Kakao's open-source AI model can interpret both words and pictures
Published: 19 Jan. 2024, 17:37
Updated: 19 Jan. 2024, 19:09
- LEE JAE-LIM
- [email protected]
Kakao unveiled its multimodal AI model “Honeybee” for the first time on Friday at a conference hosted by the Ministry of Science and ICT. The tech giant's hyperscale language model KoGPT 2.0, however, remains under wraps.
Kakao’s to-be CEO Chung Shin-a presented the source code as she discussed the company's upcoming plans for developing AI models and services.
C-suite executives from various fields related to platforms, telecommunications, beauty, TV and robotics attended the conference, which was focused on government policies and collaborations related to AI, as were executives from Samsung, LG, Doosan Robotics, Naver and Amorepacific.
Honeybee's code base was seeded to developers through GitHub on the same day, according to Kakao’s research subsidiary Kakao Brain.
The source code itself is not a large language model (LLM), but rather a module that could be plugged to other large language models. LLMs that implement would become multimodal, gaining the ability to comprehend both image and text prompts.
For instance, if a user feeds a picture of two basketball players on a court to a Honeybee-integrated LLM and asks “how many times did the player on the left win?” in English, the model could comprehend the image and the text to elicit a proper response.
Honeybee achieved the top score on a functionality test of several global multimodality evaluation protocols, including MME, MMBench and SEED-Bench.
Kakao Brain believes that Honeybee could be an innovative education tool, as it can interact with the users by simultaneously inputting a certain image and a text query, though exact forms use cases for Honeybee are still to be officially specified.
“We are deliberating on adapting Honeybee to a variety of services,” said Kakao Brain CEO Kim Il-do in a statement. “We will seamlessly put more effort on research and development (R & D) to come up with a more perfected AI model.”
Kakao is a relative latecomer to the global race for AI supremacy that OpenAI's ChatGPT catalyzed last year. Kakao initially promised to release KoGPT 2.0 last year but has since continuously postponed its release amid various allegations related to inner friction and shady dealings surrounding its acquisition of K-pop agency SM Entertainment.
Korean companies such as Naver, Korea’s largest portal site, and LG AI Research rolled out LLMs HyperCLOVA X and Exaone last year, respectively. Those models are being adapted to a variety of services across online platforms and financial firms.
BY LEE JAE-LIM [[email protected]]
with the Korea JoongAng Daily
To write comments, please log in to one of the accounts.
Standards Board Policy (0/250자)