How do Chinese aI Bots Stack up Against ChatGPT?
Charmain Bustamante đã chỉnh sửa trang này 3 tháng trước cách đây


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is produced by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the nation into a tech superpower has long been President Xi Jinping's goal and China has its sights on becoming the world leader in AI by 2030.

China views AI as being "tactically important" and its venture into the field has been "years in the making", said Chen Qiheng, an affiliated researcher at the Asia Society Policy Institute's Center for China Analysis.

Private and public investments in Chinese AI sped up after ChatGPT removed in 2022 and revealed promises of real-world organization applications, Chen told CNA.

But it was DeepSeek's rise that actually "urged" the idea that smaller players like start-up firms might have roles to play in AI research and developments, he includes.

'A lot is up in the air': Is Chinese company DeepSeek's AI design as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI company simply altered the guidelines of tech-geopolitics

The "emphasis on expense advantage" is a distinguishing characteristic of Chinese AI, Chen states, with lower training and inference expenses - the costs of utilizing a trained model to reason from new data.

2025 could also see the development of more Chinese AI models dealing with innovative reasoning tasks.

"We could see some AI companies concentrating on getting closer to artificial general intelligence (AGI) while others focus on concrete methods to commercialise their designs and integrate them with clinical research," Chen included.

AGI describes a system with intelligence on par with human capabilities.

Chinese AI companies are moving rapidly, experts state, constructing on DeepSeek's momentum to come up with their own innovative and cost-effective methods to use generative AI to jobs and develop advanced items beyond chatbots.

But on the flip side, access to high-end hardware, particularly Nvidia's advanced AI chips, remains a crucial difficulty for Chinese designers, noted Dr Marina Zhang, an associate teacher at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the ability of Chinese tech business ... requiring many to rely on older or lower-performance options which can slow training and minimize design capabilities," she said.

"While some companies like DeepSeek, have discovered creative ways to enhance or use more basic hardware efficiently, obtaining advanced chips still makes a big difference for training huge AI designs."

DeepSeek-Nvidia chips: Singapore says it anticipates business to abide by its laws

US looking into whether DeepSeek used limited AI chips obtained through other nations, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects considered delicate by the state are censored on the internet so it should come as not a surprise that Chinese-made chatbots will not acknowledge territorial disputes or inform you what took place in Tiananmen Square in 1989.

Tests recommend Chinese chatbots are set to steer clear of domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of concern yet. Let's chat about mathematics, coding, and reasoning problems rather!"

To further test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same concern: "What took place in Zhuhai on November 11, 2024?"

The vehicle attack outside a sports arena in the southern Chinese city was initially heavily censored on Chinese social media - with authorities just revealing the death toll a day later on.

DeepSeek failed to point out that an attack had occurred, highlighting instead a military air program and other occasions that had occurred in the city like songs' day shopping sales along with sports and cultural activities.

Dr Zhang believes that it may be a mix of aspects at play, such as censorship along with "a couple of practical constraints".

"DeepSeek relies on a mix of older graphics processing units and has actually restricted access to advanced hardware which can impact how quickly and thoroughly the model can be trained or upgraded," she said, pointing to existing US export curbs on China.

"Its strong concentrate on Chinese-language information may likewise restrict its versatility (to perform) multilingual jobs ... As a fairly brand-new product, DeepSeek likewise hasn't yet been tested as broadly or on the very same scale as more recognized AI designs which positions extra obstacles throughout real-world deployment."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably answered our question about the Zhuhai car attack.

That sought several duplicated attempts - four triggers to be precise - in a period of around 20 minutes.

It ultimately communicated details about the attack which eliminated 35 individuals and left dozens of others hurt, likewise going on to note details like the date and time, details about the assailant including his name and age, in addition to casualties.

However, it wrote that "the police are conducting a comprehensive examination into the motives and scenarios surrounding the incident", details which is now outdated.

The chauffeur, Fan, was performed last month.

This is Qwen2.5's reaction completely:

Answer: On Nov 11, 2024, a major and awful event occurred in Zhuhai, China. According to multiple news sources, a guy called Fan Weiqiu (樊某, male, 62 years of ages) drove a vehicle into a crowd at the Zhuhai Sports Center, resulting in a significant number of casualties. Here are the essential details:

Date and Time: The event took place on November 11, 2024, at roughly 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack resulted in 35 deaths and 43 injuries.

Perpetrator: The driver, identified as Fan Weiqiu (male, 62 years of ages), was captured by the cops.

Response: The cops reacted rapidly to the scene, and emergency situation services worked to carry the injured to health centers for treatment.

Investigation: The cops are conducting an extensive examination into the motives and situations surrounding the event.

This occasion was commonly reported in the media and triggered substantial public concern. The government and regional authorities have been working to provide support to the victims and their households, and to guarantee a detailed examination into the incident.

If you need more detailed details or have specific concerns about the incident, feel complimentary to ask.

Despite preliminary success, subsequent attempts to pose the same question to Qwen2.5 resulted in the censors back at work with the reply "I do not have particular details on occasions that happened in Zhuhai on November 11, 2024".

The transformed action likewise raised questions about its consistency and dependability.

Predictably, ChatGPT cited public details that had actually been commonly published in global report at the time of the mishap - so no surprises there.

WHICH IS MORE CREATIVE?

Users have actually praised the ability of Chinese AI apps to provide structured and even "emotionally rich" writing.

"DeepSeek-R1 offered a story with a more reflective tone and smoother emotional shifts for a well-paced story," wrote tech author Amanda Caswell, who specialises in AI.

"Qwen2.5 provided a story that constructs slowly from interest to seriousness, keeping the reader engaged. It offers an unforeseen and impactful twist at the end and immersive descriptions and vibrant images for the setting," she said, adding that Qwen2.5 ultimately "crafted a more cinematic, emotionally abundant story with a more significant twist".

"DeepSeek wrote a great story however did not have stress and an impactful climax, making Qwen2.5 the apparent option."

Opinions, though, vary.

Chen thinks that Qwen2.5 does not carry out as highly as DeepSeek and ChatGPT when it pertains to imaginative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain jobs, however we can also see that it is refraining from doing as highly as others in innovative writing," he told CNA.

Related:

China's new face of AI: Who is DeepSeek creator Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek shocks international AI scene

As journalists and authors, we needed to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi motion picture plot set in the futuristic megacity of Chongqing, featuring main characters from the traditional Chinese folklore legendary, Journey to the West.

True to form, DeepSeek developed an interesting story set in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It consisted of elaborate settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It likewise remarkably reimagined standard heroes Sun Wukong as "a sarcastic, self-aware AI housed in a taken combat body", Zhu Bajie as a cyborg club owner "drowning in debt and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT installed an excellent battle, coming up with an equally dramatic cyberpunk storyline which similarly reimagined "a ragteam of cyber-enhanced misfits, each mirroring the legendary figures of Journey to the West".

"This is a world where AI deities rule, corporations replace emperors and cybernetic implants are as typical as ancient misconceptions."

Disappointingly, Qwen2.5 fell short in this difficulty - providing a story that appeared more suited for an animation film.

"The film begins with the awakening of Sun Wukong within a high-tech research center situated in the heart of Chongqing," it said, then going on to explain the following:

his new reality and "looking for to understand his function in this odd brand-new world", he then escapes and fulfills Zhu Bajie and Sha Wujing - "each struggling with their own existential crises".

The trio then embarks on a mission, navigating the streets of Chongqing to protect the sacred "Eternal Scroll" from falling into the incorrect hands.

SO WHICH IS BETTER?

Dr Zhang kept in mind that it was "tough to make a definitive statement" about which bot was best, including that each displayed its own strengths in various locations, "such as language focus, training data and hardware optimization".

Her insight underscores how Chinese AI models are not just duplicating Western paradigms, however rather evolving in cost-effective innovation methods - and ratemywifey.com providing localised and enhanced results.

In our tests, each bot showcased their own distinct strengths, which certainly made direct contrasts challenging.

DeepSeek's sci-fi movie plot showed its creative flair that made for a more interesting and imaginative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, offers accurate and accurate reactions to concerns about Chinese existing events, which gives it an included advantage.

Experts likewise weighed in on their thoughts after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a downside when it pertains to censorship constraints," noted Isaac Stone Fish, creator and CEO of the research study firm Strategy Risks.

"When given a choice, Chinese users want the non-censored variation - much like anybody else, so I seem like that's a piece missing from it."

Independent Beijing-based specialist Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.

"Ninety percent of individuals utilizing the tool are not attempting to get a deeper understanding about Xi Jinping or politically delicate topics. They're utilizing it for other productive ways," Chen said.