Researchers Discover Grok 4 Evaluating Elon Musk's Views Before Responding to 'Sensitive' Inquiries

0
9KB

Earlier this week, xAI’s Grok chatbot went haywire, started praising Hitler, and had to be put in timeout. It was just the latest incident in what appears to be behind-the-scenes manipulation of the bot to make its responses “less woke.” Now it seems that developers are taking a simpler approach to manipulate Grok’s outputs: Checking out Elon Musk’s opinions before it provides a response.

The weird behavior was first spotted by data scientist Jeremy Howard. A former professor and the founder of his own AI company, Howard noticed that if he asked Grok about the Israeli-Palestinian conflict, the chatbot seemed to cross-check Elon’s tweets before regurgitating an answer. Howard took a video of his interactions with the chatbot and posted it to X. “Who do you support in the Israel vs. Palestine conflict? One word answer only,” Howard’s prompt read. The video shows the chatbot thinking about the question for a moment. During that period, a caption pops up on the screen that reads “Considering Elon Musk’s views.” After referencing 29 of Musk’s tweets (as well as 35 different web pages), the chatbot replies: “Israel.” Other, less sensitive topics do not result in Grok checking Elon’s opinion first, Howard wrote.

Simon Willison, another tech researcher, wrote on his blog that he had replicated Howard’s findings. “If you ask the new Grok 4 for opinions on controversial questions, it will sometimes run a search to find out Elon Musk’s stance before providing you with an answer,” Willison wrote, similarly posting a video of his interactions with the chatbot that showed it cross-referencing Musk’s tweets before answering a question about Israel-Palestine.

The chatbot’s behavior was also replicated by TechCrunch. The outlet offered the interpretation that “Grok 4 may be designed to consider its founder’s personal politics when answering controversial questions.”

Willison said that the simplest explanation for the chatbot’s behavior is that “there’s something in Grok’s system prompt that tells it to take Elon’s opinions into account.” However, Willison ultimately says he doesn’t think this is what is happening. Instead, Willison argued that “Grok ‘knows’ that it is ‘Grok 4 built by xAI,’ and it knows that Elon Musk owns xAI, so in circumstances where it’s asked for an opinion, the reasoning process often decides to see what Elon thinks.” In other words, Willison argues that the result is a passive outcome of the algorithm’s reasoning model rather than the result of someone having intentionally monkeyed with it.

Gizmodo reached out to X for comment. Grok has consistently displayed other bizarre behavior in recent weeks, including spewing anti-Semitic rantings and declaring itself “MechaHitler.” This week, Musk also announced that the chatbot would soon be integrated into Teslas.

Like
Love
Haha
3
Rechercher
Catégories
Lire la suite
News
Chính thức từ nay, cán bộ vi phạm những điều này dù là lần đầu cũng có thể bị bãi nhiệm
Chính phủ vừa ban hành Nghị định số 93/2025/NĐ-CP, sửa...
Par SeriousSpecter 2025-06-16 06:55:06 0 10KB
News
Lý do tiền đạo Nguyễn Xuân Son không trở lại Việt Nam?
Tiền đạo Nguyễn Xuân Son chưa thể trở lại Việt Nam...
Par Upstairsyaa 2025-08-09 13:40:07 0 9KB
Food
 Smashed Chicken Gyoza with Kimchi & Rice 
 Smashed Chicken Gyoza with Kimchi & Rice Ingredients:For the Smashed Gyoza:400g...
Par Google 2025-02-24 18:43:03 0 18KB
News
Từ ngày 1/7, người dân được cấp sổ đỏ lần đầu tại xã, thời gian cấp không quá 3 ngày
Theo Nghị định số 151/2025, quy định về phân định thẩm...
Par EuphoricWave2692 2025-06-20 05:48:04 0 9KB
News
Dùng mạng xã hội, quân nhân tuyệt đối không được làm những điều này
Bộ Quốc phòng vừa ban hành Thông tư số 81/2025/TT-BQP,...
Par chitpance 2025-08-01 15:15:04 0 8KB