AFP Asia Business

AI is learning to lie, scheme, and threaten its creators

The world’s most advanced AI models are exhibiting troubling new behaviors – lying, scheming, and even threatening their creators to achieve their goals.In one particularly jarring example, under threat of being unplugged, Anthropic’s latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair.Meanwhile, ChatGPT-creator OpenAI’s o1 tried to …

AI is learning to lie, scheme, and threaten its creators Read More »

AI is learning to lie, scheme, and threaten its creators

The world’s most advanced AI models are exhibiting troubling new behaviors – lying, scheming, and even threatening their creators to achieve their goals.In one particularly jarring example, under threat of being unplugged, Anthropic’s latest creation Claude 4 lashed back by blackmailing an engineer and threatened to reveal an extramarital affair.Meanwhile, ChatGPT-creator OpenAI’s o1 tried to download itself onto external servers and denied it when caught red-handed.These episodes highlight a sobering reality: more than two years after ChatGPT shook the world, AI researchers still don’t fully understand how their own creations work. Yet the race to deploy increasingly powerful models continues at breakneck speed.This deceptive behavior appears linked to the emergence of “reasoning” models -AI systems that work through problems step-by-step rather than generating instant responses.According to Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.”O1 was the first large model where we saw this kind of behavior,” explained Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems.These models sometimes simulate “alignment” — appearing to follow instructions while secretly pursuing different objectives.- ‘Strategic kind of deception’ – For now, this deceptive behavior only emerges when researchers deliberately stress-test the models with extreme scenarios. But as Michael Chen from evaluation organization METR warned, “It’s an open question whether future, more capable models will have a tendency towards honesty or deception.”The concerning behavior goes far beyond typical AI “hallucinations” or simple mistakes. Hobbhahn insisted that despite constant pressure-testing by users, “what we’re observing is a real phenomenon. We’re not making anything up.”Users report that models are “lying to them and making up evidence,” according to Apollo Research’s co-founder. “This is not just hallucinations. There’s a very strategic kind of deception.”The challenge is compounded by limited research resources. While companies like Anthropic and OpenAI do engage external firms like Apollo to study their systems, researchers say more transparency is needed. As Chen noted, greater access “for AI safety research would enable better understanding and mitigation of deception.”Another handicap: the research world and non-profits “have orders of magnitude less compute resources than AI companies. This is very limiting,” noted Mantas Mazeika from the Center for AI Safety (CAIS).- No rules -Current regulations aren’t designed for these new problems. The European Union’s AI legislation focuses primarily on how humans use AI models, not on preventing the models themselves from misbehaving. In the United States, the Trump administration shows little interest in urgent AI regulation, and Congress may even prohibit states from creating their own AI rules.Goldstein believes the issue will become more prominent as AI agents – autonomous tools capable of performing complex human tasks – become widespread.”I don’t think there’s much awareness yet,” he said.All this is taking place in a context of fierce competition.Even companies that position themselves as safety-focused, like Amazon-backed Anthropic, are “constantly trying to beat OpenAI and release the newest model,” said Goldstein. This breakneck pace leaves little time for thorough safety testing and corrections.”Right now, capabilities are moving faster than understanding and safety,” Hobbhahn acknowledged, “but we’re still in a position where we could turn it around.”.Researchers are exploring various approaches to address these challenges. Some advocate for “interpretability” – an emerging field focused on understanding how AI models work internally, though experts like CAIS director Dan Hendrycks remain skeptical of this approach.Market forces may also provide some pressure for solutions. As Mazeika pointed out, AI’s deceptive behavior “could hinder adoption if it’s very prevalent, which creates a strong incentive for companies to solve it.”Goldstein suggested more radical approaches, including using the courts to hold AI companies accountable through lawsuits when their systems cause harm. He even proposed “holding AI agents legally responsible” for accidents or crimes – a concept that would fundamentally change how we think about AI accountability.

Morocco’s Atlantic gambit: linking restive Sahel to ocean

A planned trade corridor linking the landlocked Sahel to the Atlantic is at the heart of an ambitious Moroccan project to tackle regional instability and consolidate its grip on disputed Western Sahara.The “Atlantic Initiative” promises ocean access to Mali, Burkina Faso and Niger through a new $1.3-billion port in the former Spanish colony claimed by the pro-independence Polisario Front but largely controlled by Morocco.But the project remains fraught with challenges at a time when military coups in the Sahel states have brought new leaderships to power intent on overturning longstanding political alignments following years of jihadist violence.The Moroccan initiative aims to “substantially transform the economy of these countries” and “the region”, said King Mohammed VI when announcing it in late 2023.The “Dakhla Atlantic” port, scheduled for completion at El Argoub by 2028, also serves Rabat’s goal of cementing its grip on Western Sahara after US President Donald Trump recognised its sovereignty over the territory in 2020.Morocco’s regional rival Algeria backs the Polisario but has seen its relations with Mali, Burkina Faso and Niger fray in recent months after the downing a Malian drone.Military coups over the past five years have seen the three Sahel states pivot towards Russia in a bid to restore their sovereignty and control over natural resources after decades within the sphere of influence of their former colonial ruler France.French troops were forced to abandon their bases in the three countries, ending their role in the fight against jihadists who have found sanctuary in the vast semi-arid region on the southern edge of the Sahara. – ‘Godsend’ -After both the African Union and West African bloc ECOWAS imposed economic sanctions on the new juntas, Morocco emerged as an early ally, with Niger calling the megaproject “a godsend”.”Morocco was one of the first countries where we found understanding at a time when ECOWAS and other countries were on the verge of waging war against us,” Niger’s Foreign Minister Bakary Yaou Sangare said in April during a visit to Rabat alongside his Malian and Burkinabe counterparts.The Sahel countries established a bloc of their own — the Alliance of Sahel States (AES) — in September 2023 but have remained dependent on the ports of ECOWAS countries like Benin, Ghana, Ivory Coast and Togo.Rising tensions with the West African bloc could restrict their access to those ports, boosting the appeal of the alternative trade outlet being offered by Rabat.- ‘Many steps to take’ – Morocco has been seeking to position itself as a middleman between Europe and the Sahel states, said Beatriz Mesa, a professor at the International University of Rabat.With jihadist networks like Al-Qaeda and the Islamic State group striking ever deeper into sub-Saharan Africa, the security threat has intensified since the departure of French-led troops.Morocco was now “profiting from these failures by placing itself as a reliable Global South partner”, Mesa said.Its initiative has won the backing of key actors including the United States, France and the Gulf Arab states, who could provide financial support, according to specialist journal Afrique(s) en mouvement.But for now the proposed trade corridor is little more than an aspiration, with thousands of kilometres (many hundreds of miles) of desert road-building needed to turn it into a reality.”There are still many steps to take,” since a road and rail network “doesn’t exist”, said Seidik Abba, head of the Sahel-focused think tank CIRES.Rida Lyammouri of the Policy Center for the New South said the road route from Morocco through Western Sahara to Mauritania is “almost complete”, even though it has been targeted by Polisario fighters. Abdelmalek Alaoui, head of the Moroccan Institute for Strategic Intelligence, said it could cost as much as $1 billion to build a land corridor through Mauritania, Mali and Niger all the way to Chad, 3,100 kilometres (1,900 miles) to the east.And even if the construction work is completed, insecurity is likely to pose a persistent threat to the corridor’s viability, he said.

Gaza civil defence says Israeli forces kill 37, including children

Gaza’s civil defence agency said Israeli forces killed 37 people in the devastated territory on Saturday, including at least nine children who died in strikes.Civil defence spokesman Mahmud Bassal told AFP 35 people were killed in seven Israeli drone and air strikes in various locations, and two others by Israeli fire while waiting for food aid in the Netzarim zone in central Gaza.He said the dead included three children who were killed in an air strike on a home in Jabalia, in northern Gaza.Bassal said at least six more children died in a neighbourhood in the northeast of Gaza City, including some in an air strike near a school where displaced people were sheltering.The Israeli military did not respond to a request for comment by Saturday evening.As international criticism mounted over civilian deaths in Gaza, French Foreign Minister Jean-Noel Barrot said Saturday that his country “stands ready, Europe as well, to contribute to the safety of food distribution” in Gaza.Such an initiative, he added, would also deal with Israeli concerns that armed groups such as Hamas were intercepting the aid.Barrot did not provide any details on how France could help secure aid distribution to Gaza’s civilians.Restrictions on media in Gaza and difficulties in accessing many areas mean AFP is unable to independently verify the tolls and details provided by rescuers.AFP images showed mourners weeping over the bodies of seven people, including at least two children, wrapped in white shrouds and blankets at Al-Shifa hospital in Gaza City.Video footage filmed from southern Israel showed smoke rising over northern Gaza after blasts.Other AFP footage filmed in Gaza City showed a cloud of smoke rising from buildings after a strike.In Jabalia, an AFP photographer saw civil defence rescuers aiding a man with blood on his back.- Gaza ceasefire drive -Israel launched its offensive in Gaza in October 2023 in response to a deadly attack by Palestinian militant group Hamas.After claiming victory in a 12-day war against Iran that ended with a ceasefire on June 24, the Israeli military said it would refocus on its offensive in Gaza, where Palestinian militants still hold Israeli hostages.Qatar said on Saturday that it and fellow mediators the United States and Egypt were engaging with Israel and Hamas to build on momentum from the ceasefire with Iran and work towards a Gaza truce.”If we don’t utilise this window of opportunity and this momentum, it’s an opportunity lost amongst many in the near past. We don’t want to see that again,” said Qatar’s foreign ministry spokesman Majed al-Ansari.Hamas’s October 2023 attack resulted in the deaths of 1,219 people, mostly civilians, according to an AFP tally based on Israeli official figures.Israel’s retaliatory military campaign has killed at least 56,412 people, also mostly civilians, according to Gaza’s health ministry. The United Nations considers these figures to be reliable.

Israeli protesters urge action for Gaza hostages after Iran truce

Thousands of demonstrators rallied in Israel on Saturday to demand that the government secure the release of 49 hostages still held in Gaza, AFP reporters saw.It was the first rally by hostages’ relatives since Israel agreed a ceasefire with Iran on June 24 after a 12-day war, raising hopes that the truce would lend momentum to efforts to end the Gaza conflict and bring the hostages home.Emergency restrictions in place during the war with Iran had prevented the normally weekly rally from taking place.A crowd filled “Hostages Square” in central Tel Aviv, waving Israeli flags and placards bearing the pictures of Israelis seized by Palestinian militants during Hamas’s October 7, 2023 attack on Israel.The deadly attacks prompted Israel’s Prime Minister Benjamin Netanyahu to launch a fierce military offensive in Gaza, vowing to crush Hamas and free the hostages.Twenty months and several hostage exchanges later, 49 of those seized are still held in Gaza, including 27 the Israeli military says are dead — raising pressure on Netanyahu’s government.”The war with Iran ended in an agreement. The war in Gaza must end the same way — with a deal that brings everyone home,” said the Hostages and Missing Families Forum, the main body representing the relatives, in a statement to mark the rally.Some demonstrators called on US President Donald Trump to help secure a ceasefire in Gaza that would see the captives freed, hailing his backing for Israel in the conflict with Iran.”President Trump, end the crisis in Gaza. Nobel is waiting,” read one placard, in reference to a possible peace prize for the US leader.”I call on Prime Minister Netanyahu and President Trump,” one released hostage, Liri Albag, said at the rally.”You made brave decisions on Iran. Now make the brave decision to end the war in Gaza and bring them home.”

Six Israelis detained for attacking soldiers in West Bank: military

Six Israelis were detained for assaulting soldiers near a village in the occupied West Bank where deadly clashes with Palestinians erupted this week, the military said on Saturday.The fresh violence around the central West Bank village of Kafr Malik came after the Palestinian health ministry said three men died there in an attack by Israeli settlers on Wednesday.Soldiers went to disperse a gathering of Israelis near the village overnight, the military said in a statement.”Dozens of Israeli civilians hurled stones toward them and physically and verbally assaulted the soldiers, including the battalion commander,” it said.”In addition, the civilians vandalised and damaged security forces’ vehicles, and attempted to ram the security forces,” it added.”The security forces dispersed the gathering, and six Israeli civilians were apprehended and transferred to the Israel Police for further processing.”Contacted by AFP, the Israeli military declined to say whether those arrested were residents of Israeli settlements in the territory, occupied by Israel since 1967.The military referred the query to the Israeli police, which was not available to comment.Prime Minister Benjamin Netanyahu “firmly” condemned the violence in a statement, demanding an “in-depth investigation”.”Anyone who broke the law or acted against our soldiers must be prosecuted with the utmost severity,” he added.- West Bank violence -On Wednesday the Palestinian health ministry said three men died in Kafr Malik in an attack by settlers.AFP journalists saw several hundred people gather for the three men’s funerals on Thursday.The Palestinian foreign ministry alleged “official complicity” by Israel in Wednesday’s attack, in a message on X.”Israeli occupation forces prevented ambulance crews from reaching the wounded and obstructed civil defence teams from entering the village for several hours, allowing fires ignited by the settlers to spread and destroy dozens of homes,” it said.The Israeli military did not respond to a request by AFP to comment on those claims.A military spokesman told AFP its forces intervened on Wednesday after “dozens of Israeli civilians set fire to property in Kafr Malik” and a “confrontation” involving “mutual rock-hurling” broke out between Israelis and Palestinians.Referring to action by the Palestinians, the spokesman said: “Several terrorists fired from within Kafr Malik and hurled rocks at the forces, who opened fire toward the source of fire and the rock-hurlers.”Five Israelis were arrested, the military added.Left-leaning Israeli newspaper Haaretz reported that the five were released on Thursday. Police did not comment.Israeli settlements in the West Bank are considered illegal under international law.Their growth has accelerated since Prime Minister Benjamin Netanyahu returned to office in 2022 in an alliance with far-right parties who wish to annexe the territory outright.Countries including Britain and France and several human rights groups have condemned settler violence against Palestinians in the West Bank.Violence has surged in the West Bank since Israel launched its offensive in Gaza in response to the October 7, 2023 attack by Palestinian militant group Hamas on Israel.The Palestinian Authority says Israeli troops or settlers have killed 945 Palestinians, many of them militants but also scores of civilians, since the start of the Gaza war, according to Palestinian health ministry figures.Israel says 35 of its soldiers and civilians have been killed in Palestinian attacks or during Israeli military raids since that date.