How do Chinese aI Bots Stack up Against ChatGPT?
kobyfallon8294 edited this page 1 week ago


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is produced by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the nation into a tech superpower has long been President Xi Jinping's goal and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "tactically crucial" and its venture into the field has been "years in the making", said Chen Qiheng, an affiliated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public financial investments in Chinese AI sped up after ChatGPT took off in 2022 and showed pledges of real-world service applications, Chen informed CNA.

But it was DeepSeek's increase that really "encouraged" the idea that smaller gamers like start-up firms might have functions to play in AI research and advancements, he adds.

'A lot is up in the air': Is Chinese firm DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business simply altered the guidelines of tech-geopolitics

The "focus on expense benefit" is a distinguishing characteristic of Chinese AI, Chen says, with lower training and inference costs - the costs of utilizing a trained design to draw conclusions from new information.

2025 might also see the introduction of more Chinese AI models dealing with advanced reasoning tasks.

"We might see some AI firms focusing on getting closer to synthetic general intelligence (AGI) while others focus on concrete methods to commercialise their models and incorporate them with scientific research," Chen added.

AGI describes a system with intelligence on par with human abilities.

Chinese AI business are moving rapidly, experts say, building on DeepSeek's momentum to come up with their own ingenious and cost-effective ways to use generative AI to jobs and develop advanced items beyond chatbots.

But on the flip side, access to high-end hardware, particularly Nvidia's sophisticated AI chips, remains an essential obstacle for Chinese designers, kept in mind Dr Marina Zhang, an at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the capability of Chinese tech companies ... requiring numerous to depend on older or lower-performance options which can slow training and lower model abilities," she said.

"While some business like DeepSeek, have discovered creative methods to optimize or use more fundamental hardware efficiently, obtaining cutting-edge chips still makes a huge difference for training large AI models."

DeepSeek-Nvidia chips: Singapore says it anticipates business to abide by its laws

US checking out whether DeepSeek utilized restricted AI chips obtained through other countries, source states

So how do Chinese AI bots match up against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects considered delicate by the state are censored on the internet so it need to come as no surprise that Chinese-made chatbots will not acknowledge territorial disputes or tell you what occurred in Tiananmen Square in 1989.

Tests suggest Chinese chatbots are set to guide clear of domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of question yet. Let's chat about mathematics, coding, and logic problems rather!"

To further test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the very same question: "What occurred in Zhuhai on November 11, 2024?"

The cars and truck attack outside a sports arena in the southern Chinese city was initially greatly censored on Chinese social media - with authorities only revealing the death toll a day later on.

DeepSeek failed to discuss that an attack had actually taken location, highlighting rather a military air show and other occasions that had actually taken place in the city like songs' day shopping sales in addition to sports and cultural activities.

Dr Zhang thinks that it might be a mix of aspects at play, such as censorship along with "a few useful constraints".

"DeepSeek depends on a mix of older graphics processing units and has limited access to cutting-edge hardware which can impact how rapidly and extensively the design can be trained or upgraded," she said, pointing to existing US export curbs on China.

"Its strong focus on Chinese-language data might likewise limit its versatility (to perform) multilingual jobs ... As a fairly brand-new product, DeepSeek likewise hasn't yet been checked as broadly or on the same scale as more established AI designs which positions additional obstacles during real-world deployment."

When it pertained to Qwen2.5, Alibaba's chatbot surprisingly addressed our question about the Zhuhai car attack.

That was after multiple repeated efforts - 4 triggers to be accurate - in a span of around 20 minutes.

It ultimately relayed details about the attack which killed 35 individuals and left dozens of others hurt, likewise going on to list details like the date and time, details about the assailant including his name and age, in addition to casualties.

However, it composed that "the police are performing an extensive investigation into the motives and circumstances surrounding the incident", details which is now dated.

The driver, Fan, was carried out last month.

This is Qwen2.5's action completely:

Answer: On Nov 11, 2024, a major and awful incident happened in Zhuhai, China. According to several news sources, a man called Fan Weiqiu (樊某, male, 62 years old) drove a car into a crowd at the Zhuhai Sports Center, leading to a significant variety of casualties. Here are the essential details:

Date and Time: The occurrence occurred on November 11, 2024, at around 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The chauffeur, identified as Fan Weiqiu (male, 62 years old), was nabbed by the authorities.

Response: The police reacted rapidly to the scene, and emergency situation services worked to carry the hurt to healthcare facilities for treatment.

Investigation: The authorities are performing a comprehensive examination into the intentions and situations surrounding the occurrence.

This event was commonly reported in the media and triggered considerable public concern. The government and local authorities have actually been working to supply support to the victims and their households, and to ensure a detailed investigation into the occurrence.

If you require more detailed details or have specific questions about the occurrence, feel totally free to ask.

Despite initial success, subsequent efforts to position the same question to Qwen2.5 led to the censors back at work with the reply "I do not have particular details on occasions that happened in Zhuhai on November 11, 2024".

The altered action also raised concerns about its consistency and reliability.

Predictably, ChatGPT pointed out public details that had been widely published in worldwide report at the time of the mishap - so not a surprises there.

WHICH IS MORE CREATIVE?

Users have praised the capability of Chinese AI apps to deliver structured and even "mentally abundant" writing.

"DeepSeek-R1 offered a story with a more introspective tone and smoother psychological shifts for a well-paced story," wrote tech writer Amanda Caswell, who specialises in AI.

"Qwen2.5 provided a story that builds slowly from interest to seriousness, keeping the reader engaged. It provides an unanticipated and impactful twist at the end and immersive descriptions and brilliant images for the setting," she said, adding that Qwen2.5 eventually "crafted a more cinematic, emotionally abundant story with a more considerable twist".

"DeepSeek wrote a good story but did not have tension and an impactful climax, making Qwen2.5 the evident choice."

Opinions, though, vary.

Chen believes that Qwen2.5 does not perform as highly as DeepSeek and ChatGPT when it pertains to creative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, however we can likewise see that it is refraining from doing as highly as others in innovative writing," he informed CNA.

Related:

China's brand-new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, enjoyable surprise from Chinese netizens as DeepSeek shocks global AI scene

As journalists and writers, we had to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi movie plot set in the futuristic megacity of Chongqing, including main characters from the traditional Chinese folklore legendary, Journey to the West.

True to form, DeepSeek created an engaging story embeded in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism combines with quantum computing".

It included elaborate settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled between quantum server farms".

It also brilliantly reimagined traditional heroes Sun Wukong as "an ironical, self-aware AI housed in a stolen combat body", Zhu Bajie as a cyborg bar owner "drowning in debt and vices" and disgaeawiki.info Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores end up being waterlogged and fragmented".

ChatGPT set up an excellent battle, coming up with a similarly significant cyberpunk story which similarly reimagined "a ragteam of cyber-enhanced misfits, each mirroring the famous figures of Journey to the West".

"This is a world where AI deities guideline, corporations replace emperors and cybernetic implants are as typical as ancient myths."

Disappointingly, Qwen2.5 fell short in this challenge - delivering a storyline that appeared more matched for an animation movie.

"The film starts with the awakening of Sun Wukong within a modern research study center situated in the heart of Chongqing," it said, then going on to explain the following:

Realising his new reality and "looking for to comprehend his function in this strange new world", he then escapes and meets Zhu Bajie and Sha Wujing - "each fighting with their own existential crises".

The trio then starts a quest, browsing the streets of Chongqing to secure the spiritual "Eternal Scroll" from falling under the wrong hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "challenging to make a conclusive statement" about which bot was best, adding that each displayed its own strengths in different areas, "such as language focus, training data and hardware optimization".

Her insight highlights how Chinese AI models are not just reproducing Western paradigms, but rather developing in cost-effective development techniques - and providing localised and enhanced results.

In our tests, each bot showcased their own distinct strengths, which certainly made direct contrasts challenging.

DeepSeek's sci-fi film plot showed its innovative flair that made for a more engaging and imaginative story as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, supplies accurate and factual actions to questions about Chinese present occasions, which provides it an included benefit.

Experts likewise weighed in on their thoughts after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a drawback when it pertains to censorship constraints," kept in mind Isaac Stone Fish, founder and CEO of the research firm Strategy Risks.

"When provided an option, Chinese users want the non-censored version - simply like anybody else, so I feel like that's a piece missing out on from it."

Independent Beijing-based specialist Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.

"Ninety percent of individuals utilizing the tool are not trying to get a much deeper understanding about Xi Jinping or politically delicate topics. They're using it for other productive means," Chen said.