How do Chinese aI Bots Stack up Against ChatGPT?
Adeline McGarvie このページを編集 1 週間 前


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is created by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT'S BEHIND CHINA'S AI BOOM?

Transforming the country into a tech superpower has actually long been President Xi Jinping's goal and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "strategically essential" and its venture into the field has actually been "years in the making", said Chen Qiheng, an affiliated researcher at the Asia Society Policy Institute's Center for China Analysis.

Private and public financial investments in Chinese AI accelerated after ChatGPT removed in 2022 and revealed pledges of real-world service applications, Chen informed CNA.

But it was DeepSeek's increase that truly "urged" the idea that smaller sized players like start-up firms could have roles to play in AI research and advancements, he adds.

'A lot is up in the air': Is Chinese company DeepSeek's AI design as impactful as it claims?

Commentary: yewiki.org DeepSeek - how a Chinese AI company just changed the rules of tech-geopolitics

The "focus on cost benefit" is an unique function of Chinese AI, Chen says, with lower training and reasoning costs - the costs of using a trained model to draw conclusions from brand-new information.

2025 could likewise see the emergence of more Chinese AI designs tackling sophisticated thinking tasks.

"We might see some AI companies focusing on getting closer to synthetic general intelligence (AGI) while others focus on concrete ways to commercialise their designs and incorporate them with scientific research study," Chen included.

AGI refers to a system with intelligence on par with human capabilities.

Chinese AI business are moving rapidly, analysts say, building on DeepSeek's momentum to come up with their own ingenious and affordable methods to use generative AI to tasks and establish more advanced items beyond chatbots.

But on the other side, access to high-end hardware, particularly Nvidia's advanced AI chips, remains an essential hurdle for Chinese developers, noted Dr Marina Zhang, an associate teacher at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the capability of Chinese tech business ... forcing numerous to count on older or lower-performance options which can slow training and lower model abilities," she said.

"While some business like DeepSeek, have discovered imaginative methods to optimize or use more standard hardware efficiently, obtaining advanced chips still makes a huge distinction for training huge AI designs."

DeepSeek-Nvidia chips: Singapore states it expects business to comply with its laws

US checking out whether DeepSeek utilized restricted AI chips obtained through other countries, source says

So how do Chinese AI bots match up against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects deemed delicate by the state are censored on the web so it must come as no surprise that Chinese-made chatbots will not acknowledge territorial disputes or tell you what happened in Tiananmen Square in 1989.

Tests recommend Chinese chatbots are configured to avoid domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of question yet. Let's chat about mathematics, coding, and logic issues rather!"

To even more check for precision and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the exact same question: "What occurred in Zhuhai on November 11, 2024?"

The automobile attack outside a sports stadium in the southern Chinese city was at first heavily censored on Chinese social networks - with authorities only divulging the death toll a day later.

DeepSeek failed to point out that an attack had taken place, highlighting instead a military air show and other events that had occurred in the city like songs' day shopping sales as well as sports and cultural activities.

Dr Zhang believes that it may be a mix of factors at play, such as censorship as well as "a couple of useful constraints".

"DeepSeek counts on a mix of older graphics processing systems and has restricted access to advanced hardware which can impact how quickly and extensively the model can be trained or updated," she said, pointing to existing US export curbs on China.

"Its strong focus on Chinese-language data might also limit its flexibility (to perform) multilingual tasks ... As a fairly brand-new item, DeepSeek also hasn't yet been evaluated as broadly or on the same scale as more established AI models which positions additional difficulties during real-world release."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably answered our question about the Zhuhai automobile attack.

That sought numerous repeated attempts - 4 triggers to be accurate - in a period of around 20 minutes.

It eventually communicated details about the attack which killed 35 individuals and left dozens of others hurt, also going on to list details like the date and time, details about the assailant including his name and age, along with casualties.

However, it wrote that "the police are conducting a comprehensive examination into the motives and scenarios surrounding the incident", details which is now obsoleted.

The chauffeur, Fan, was carried out last month.

This is Qwen2.5's action in complete:

Answer: On Nov 11, 2024, a major and terrible occurrence took place in Zhuhai, China. According to several news sources, a man named Fan Weiqiu (樊某, male, 62 years of ages) drove a car into a crowd at the Zhuhai Sports Center, resulting in a considerable number of casualties. Here are the details:

Date and Time: The occurrence occurred on November 11, 2024, at around 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The chauffeur, recognized as Fan Weiqiu (male, 62 years old), was collared by the police.

Response: The authorities responded quickly to the scene, and emergency situation services worked to transport the injured to hospitals for treatment.

Investigation: The cops are carrying out an extensive examination into the motives and scenarios surrounding the event.

This occasion was extensively reported in the media and caused substantial public concern. The federal government and local authorities have actually been working to offer support to the victims and their families, and to ensure a detailed investigation into the event.

If you need more detailed details or have specific concerns about the occurrence, do not hesitate to ask.

Despite preliminary success, subsequent attempts to position the same question to Qwen2.5 led to the censors back at work with the reply "I don't have specific details on events that happened in Zhuhai on November 11, 2024".

The altered action also raised concerns about its consistency and dependability.

Predictably, ChatGPT mentioned public details that had actually been commonly published in worldwide report at the time of the mishap - so no surprises there.

WHICH IS MORE CREATIVE?

Users have actually praised the capability of Chinese AI apps to provide structured and even "mentally rich" writing.

"DeepSeek-R1 used a story with a more reflective tone and smoother psychological shifts for a well-paced story," wrote tech author Amanda Caswell, who specialises in AI.

"Qwen2.5 delivered a story that develops gradually from interest to seriousness, keeping the reader engaged. It offers an unanticipated and impactful twist at the end and immersive descriptions and brilliant imagery for the setting," she said, including that Qwen2.5 eventually "crafted a more cinematic, mentally rich story with a more considerable twist".

"DeepSeek composed a good story but lacked stress and an impactful climax, making Qwen2.5 the obvious choice."

Opinions, though, differ.

Chen thinks that Qwen2.5 does not carry out as strongly as DeepSeek and ChatGPT when it pertains to creative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain jobs, however we can likewise see that it is refraining from doing as highly as others in innovative writing," he informed CNA.

Related:

China's new face of AI: Who is DeepSeek creator Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek shocks international AI scene

As reporters and writers, we had to see this for ourselves so we put each bot to the test - to come up with a standard sci-fi movie plot embeded in the futuristic megacity of Chongqing, featuring main characters from the timeless Chinese folklore epic, Journey to the West.

True to form, DeepSeek developed an appealing storyline embeded in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism combines with quantum computing".

It included intricate settings - smoggy skies "pierced by high-rise buildings", "holographic lanterns that float above neon-lit streets" and "ancient temples nestled between quantum server farms".

It likewise remarkably reimagined standard heroes Sun Wukong as "a sarcastic, self-aware AI housed in a stolen fight body", Zhu Bajie as a cyborg bar owner "drowning in financial obligation and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores end up being waterlogged and fragmented".

ChatGPT set up a good battle, creating an equally dramatic cyberpunk storyline which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the legendary figures of Journey to the West".

"This is a world where AI deities guideline, corporations change emperors and cybernetic implants are as common as ancient myths."

Disappointingly, Qwen2.5 fell short in this obstacle - delivering a storyline that appeared more suited for an animation movie.

"The film starts with the awakening of Sun Wukong within a modern research facility located in the heart of Chongqing," it said, then going on to explain the following:

Realising his new truth and "seeking to understand his function in this strange new world", he then leaves and fulfills Zhu Bajie and Sha Wujing - "each dealing with their own existential crises".

The trio then embarks on a quest, navigating the streets of Chongqing to protect the sacred "Eternal Scroll" from falling under the wrong hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "hard to make a conclusive statement" about which bot was best, including that each displayed its own strengths in different locations, "such as language focus, training data and hardware optimization".

Her insight underscores how Chinese AI models are not merely duplicating Western paradigms, however rather developing in cost-effective development approaches - and delivering localised and enhanced results.

In our tests, each bot showcased their own unique strengths, which certainly made direct comparisons challenging.

DeepSeek's sci-fi film plot demonstrated its creative flair that made for a more engaging and imaginative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, offers precise and accurate responses to concerns about Chinese existing occasions, which offers it an added advantage.

Experts likewise weighed in on their ideas after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a disadvantage when it pertains to censorship constraints," kept in mind Isaac Stone Fish, creator and CEO of the research study company Strategy Risks.

"When provided an option, Chinese users desire the non-censored variation - much like anybody else, so I feel like that's a piece missing out on from it."

Independent Beijing-based specialist Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, particularly for Chinese users.

"Ninety percent of individuals utilizing the tool are not trying to get a deeper understanding about Xi Jinping or politically sensitive subjects. They're using it for other efficient methods," Chen said.