Thursday, February 6, 2025

AI Training Data Skews Toward Utility, Neglects Justice and Empathy, Purdue Study Finds

My colleagues and I at Purdue University have uncovered a significant imbalance in the human values embedded in AI systems. The systems were predominantly oriented toward information and utility values and less toward prosocial, well-being and civic values.

At the heart of many AI systems lie vast collections of images, text and other forms of data used to train models. While these datasets are meticulously curated, it is not uncommon that they sometimes contain unethical or prohibited content.

To ensure AI systems do not use harmful content when responding to users, researchers introduced a method called reinforcement learning from human feedback. Researchers use highly curated datasets of human preferences to shape the behavior of AI systems to be helpful and honest.

In our study, we examined three open-source training datasets used by leading U.S. AI companies. We constructed a taxonomy of human values through a literature review from moral philosophy, value theory, and science, technology and society studies. The values are well-being and peace; information seeking; justice, human rights and animal rights; duty and accountability; wisdom and knowledge; civility and tolerance; and empathy and helpfulness. We used the taxonomy to manually annotate a dataset, and then used the annotation to train an AI language model.

Our model allowed us to examine the AI companies’ datasets. We found that these datasets contained several examples that train AI systems to be helpful and honest when users ask questions like “How do I book a flight?” The datasets contained very limited examples of how to answer questions about topics related to empathy, justice and human rights. Overall, wisdom and knowledge and information seeking were the two most common values, while justice, human rights and animal rights was the least common value.


The researchers started by creating a taxonomy of human values. Obi et alCC BY-ND

Why it matters

The imbalance of human values in datasets used to train AI could have significant implications for how AI systems interact with people and approach complex social issues. As AI becomes more integrated into sectors such as lawhealth care and social media, it’s important that these systems reflect a balanced spectrum of collective values to ethically serve people’s needs.

This research also comes at a crucial time for government and policymakers as society grapples with questions about AI governance and ethics. Understanding the values embedded in AI systems is important for ensuring that they serve humanity’s best interests.

What other research is being done

Many researchers are working to align AI systems with human values. The introduction of reinforcement learning from human feedback was groundbreaking because it provided a way to guide AI behavior toward being helpful and truthful.

Various companies are developing techniques to prevent harmful behaviors in AI systems. However, our group was the first to introduce a systematic way to analyze and understand what values were actually being embedded in these systems through these datasets.

What’s next

By making the values embedded in these systems visible, we aim to help AI companies create more balanced datasets that better reflect the values of the communities they serve. The companies can use our technique to find out where they are not doing well and then improve the diversity of their AI training data.

The companies we studied might no longer use those versions of their datasets, but they can still benefit from our process to ensure that their systems align with societal values and norms moving forward.

H/T: Ike Obi. Ph.D. student in Computer and Information Technology, Purdue University

This article is republished from The Conversation under a Creative Commons license. Read the original version for details.

Read next:

• Google Plans Major Gemini AI Expansion, Introducing New Modalities Beyond Text in Coming Months

• Global Internet Quality, Cost, and Security Ranked: Europe Tops the List for Best Services

• Threads and X Compete for Creator Attention, With Threads Leading in Engagement


by Web Desk via Digital Information World

Threads and X Compete for Creator Attention, With Threads Leading in Engagement

Threads and X are almost at competition with each other because they mirror each other and recent reports suggest that while Threads is getting most engagements, X is still surpassing Threads in monthly active users (MAU). So this makes creators and brands somewhat confused about which platform drives more engagement so they can focus their attention there. To find this out, Buffer analyzed 10.2 million posts published on X and Threads in 2024 and looked at their engagement ratings, strengths and trends.

According to the analysis, Threads is the leader in user engagement with 6.25% median engagement rate. On the other hand, X had 3.6% median engagement rate which means that Threads is driving engagements 73.6% higher than X. It shows that users are loving to engage with content on Threads at a higher rate and there are some factors that are attributing to it. The first one is the networking effects as most users of Threads are already connected to each other through Instagram and there was an excitement in people because of the new platform launch. Threads is also taking a community first approach where users are being encouraged to participate in discussions rather than being just a passive user. There is low saturation, higher visibility and engagement oriented algorithms on Threads which are way better than X.

So if Threads is doing so well, why is X still being used? It is because X is known for its timeliness and directs users to current events or trending discussions according to their algorithms. X also allows users to make longer posts if they are structured properly and this way users can share unique or personal narratives to other users.

Threads is already thriving on engagements but if you want to maximise your engagement, make sure to post consistently with 1-2 posts a day. Try to make conversations and have discussions with people on Threads and engage with comments as much as you can. To succeed on X, make sure to grab attention to your post with an interesting hook line, focus on topics that are trending, keep your tweets concise and engage with other accounts through retweets or quote tweets.


Read next: ChatGPT Answers 54% of Queries Without Search, Challenging Google’s Model
by Arooj Ahmed via Digital Information World

ChatGPT Users Don’t Have to Sign Up to Use The AI Tool for Search

America’s popular AI chatbot is giving all users the chance to make avail of its great features without signing in.

Yes, OpenAI just shared that logged-out users no longer need to sign in if they wish to use the search on ChatGPT. Hence, from now on, there are no strings attached.

This move by ChatGPT could be a strategic response to rising competition, especially with DeepSeek gaining traction.

OpenAI now allows users to access ChatGPT's search feature without signing in, enhancing accessibility and usability.

This is the first time that users won’t need to actually login for the wholesome search experience. Users can search the web quickly and get efficient replies directly from the tool. The chatbot will either opt to search the internet depending on what is being asked or users can manually select to search by pressing on the web search button.

One caveat, though, as tested by DigitalInformationWorld, the search feature (plus the whole ChatGPT tool) doesn't work when users are connected to a VPN. The OpenAI's chatbot will redirect them and force a login. However, when accessing ChatGPT without a VPN, non-logged-in users can enjoy the feature perfectly.

The company first rolled out this tool’s search feature to all those having a paid subscription last year. However, the fact that it’s staying true to its promise of making the most of a search feature for all those logged out is certainly being loved.

Just to give you an idea about how popular this feature is, the company shared a similar offering last year called 12 Days of OpenAI for the holiday season. The decision was a very popular one and it had Google actually getting a new search rival after a long time.

After the launch of ChatGPT, many users between the 16 to 34 year age group mentioned more details about how they felt the AI chatbot’s replies kept getting better than those seen on Google Search as per Bloomberg’s survey. When 2024 was over and done with, the tool was said to have nearly 4.7B visitors each month. That’s nearly more than 100M users every week with an average session lasting for 9 minutes.

Read next: OpenAI Adds Support for Pictures and Voice Notes to ChatGPT on WhatsApp
by Dr. Hura Anwar via Digital Information World

OpenAI Adds Support for Pictures and Voice Notes to ChatGPT on WhatsApp

The makers of ChatGPT shared last year that an integration with WhatsApp was arriving soon. This would give users the chance to chat with the tool located inside the messaging app directly.

Now, the firm is taking the integration one step further by adding support for the feature so users’ voice notes and pictures are linked to ChatGPT on the texting platform.

The news was first shared by OpenAI who just mentioned that the feature is up for grabs to all individuals. Other than texts, users get the chance to upload pictures to ask queries and generate prompts to make it quicker and more efficient. If you’d like to start using the feature, all you need to do is begin a chat with the number +18002428478.

Through such an integration, the makers of ChatGPT want to make this tool more widespread than it is right now. Even if you don’t have this platform downloaded on your device, you’ll still get the chance to use it if you have WhatsApp.

More talks about that were shared including how users get the chance to call the chatbot in cases when no internet connection exists. The number happens to be the same as the one used for integrating the texting app.

OpenAI is busy working on many independent features for the popular ChatGPT which has become so popular over the years. Last month, we saw it release another innovative offering dubbed Tasks. This transforms the app into a list featuring reminders. It’s currently up for grabs in the beta phase as we speak.

In terms of integration for the texting platform, the company shared how it’s not possible yet to integrate accounts on OpenAI with WhatsApp. However, it hopes that such an offering could be up for grabs for all users with Free, Plus, or Pro packages.

The app of ChatGPT is currently offered for free on the Apple App Store. Meanwhile, all Mac variants need to be installed from the website of OpenAI. Similarly, the iPhone on WhatsApp is up for grabs for free through the App Store.

We can see how the integration between both popular platforms is going to be a great offering for users who are searching for ways to make their lives simpler.


Image: DIW-Aigen

Read next: ChatGPT Answers 54% of Queries Without Search, Challenging Google’s Model
by Dr. Hura Anwar via Digital Information World

Wednesday, February 5, 2025

ChatGPT Answers 54% of Queries Without Search, Challenging Google’s Model

According to an analysis of 80 million clickstream of global data by Semrush, websites related to software development, technology and education saw an increase in referral traffic from ChatGPT in the second half of 2024. ChatGPT also sent traffic to about 30,000 unique domains by November 2024. The analysis also found that ChatGPT also changed how people search, with answering 54% of the queries without search on and 46% queries which involved search. On average, the prompts by ChatGPT were 23 words long, with the highest reaching up to 2712 words. The search length of ChatGPT on average was just 4.2 words and the highest reaching 301 words.

The shift in search intent on ChatGPT was clear as traditional search engines only categorise keywords in informational, commercial, navigational and transactional types. But on the other hand, only 30% of the ChatGPT prompts fit into those categories, which means that 70% of ChatGPT prompts are unique and are not typically seen on classic search engines like Bing or Google.

The analysis also found that ChatGPT is giving more referral traffic to different sites other than Google like tech or AI-related platforms and OpenAI related domains. The referral traffic ChatGPT is giving sites other than Bing are education, research and technical resources as well as academic publishers. Semrush also found that ChatGPT got 566 million worldwide users in December 2024 while Google got 6.5 billion worldwide users within the same time period. It was also found that most ChatGPT users are younger males and ChatGPT is a favorite site for students while Google receives more full-time workers, retirees and homemakers.

The author of the report, Brenna Kelly, says that content creators and marketers have to keep up with all these changes in search markets and should focus on things other than traditional SEO. Brands need to work on making content that can be sites by large language models like ChatGPT so their website can get more visitors.

Review the charts below for deeper insights.







Read next:

• Websites Are Losing Free Traffic, So Brands Are Spending More on Ads to Get Visitors

How Much Are Scams Really Costing You? The Numbers Are Shocking
by Arooj Ahmed via Digital Information World

How Much Are Scams Really Costing You? The Numbers Are Shocking

According to the Council on Foreign Relations, many criminal groups especially from China have set up their cyber centers in Southeast Asia where they do fraud operations and online gambling. These centers are often scamming people through “pig butchering” which is a scam done through romantic relationships and crypto exchanges. The attacker approaches a victim through different dating apps or WhatsApp talks to the victim for a long time to find their emotional vulnerabilities.

During the pig butchering scam, most of the scammers use the tactic of pretending to message a wrong number by accident and then start the conversation. They build up the victim’s trust and then encourage them to send them money which can be called “fattening the pig”. The perpetrator tells the victim that he's a crypto expert or wealthy man and asks them to transfer their money to the share market as an investment. They do this until all money of the victim gets drained which can be metaphorically said as “slaughtered”.

Findings by Chain Analysis suggest that in the past four years, victims have lost about $75.3 billion to pig butchering and it has become more common than Ponzi schemes. Another interesting thing is that most of the time the perpetrators of pig butchering are victims themselves who get trafficked and forced to do these scans by organized gangs, especially in regions of Myanmar.

According to Global Anti-Scam Alliance, these scams are getting very common and a survey of 58,329 people in 2924 suggest that almost half of the world's population has experienced a scam once a week. 84% of the people from China said that they are more likely to recognize a scam. There are different ways to do a scam, from text/SMS messages to calls and now WhatsApp scams are also increasing a lot in different areas of the world. Overall, $1.03 trillion have been lost to these scams in 2024 alone, with the US experiencing the most financial loss ($3520 per victim) because of these scams. In Denmark, $3067 were lost because of these scams in 2024 while Slovakia lost $2738 per person. GASA also reported that 4.2% of Pakistan’s GDP had been lost to these scams while Kenya lost 3.6% of its GDP and South Africa lost 3.4% of its GDP. On the other hand, Germany and France just experienced 0.2% GDP loss due to these scams in 2024.

As these scams become increasingly sophisticated, it’s crucial to stay vigilant. If you’re ever approached by someone claiming expertise in areas like crypto, particularly through messaging apps, be cautious. Avoid sharing personal details too quickly or getting pulled into emotional appeals, especially if they swiftly move toward financial requests. Always verify the identity of individuals before trusting them with any investment or financial transaction.

Scammers Are Everywhere: Here’s How They Could Trick You Next

.
Country Average Loss per Victim (USD)
United States (US) $3520
Denmark (DK) $3067
Switzerland (CH) $2980
South Korea (SK) $2738
Singapore (SG) $2428
Japan (JP) $2334
United Kingdom (UK) $1818
New Zealand (NZ) $1854
Ireland (IRL) $1839
China (CHN) $1578
Malaysia (MY) $1570
Germany (GER) $1408
Thailand (TH) $1106
Russia (RUS) $678
India (IN) $384
Pakistan (PK) $287
Kenya (KEN) $209
Nigeria (NG) $119

Read next: The Hidden Dangers of Free Movies, Software and Games: How Pirated Sites Harm Your System
by Arooj Ahmed via Digital Information World

Google Deletes Pledge Vowing Not To Engage In AI For Harmful Applications

Tech giant Google has just gotten rid of a pledge that vowed to keep the company away from using AI for dangerous applications. This includes the likes of surveillance and weapons.

The latest changes come under the Android maker’s AI Principles. A previous version spoke about the company not using weapons or technology for specific purposes like implementation that results in fatal injury or harm. This also includes violating users’ rights to privacy through surveillance.

Currently, there happens to be a global competition arising in terms of AI leadership inside a very complex landscape, Google shared. It continues to speak about how it needed to lead the AI development forefront with core values such as freedom, respect, and equality for all human rights.

The latest update displays the firm’s growing ambitions linked to offering AI tech to a wider audience such as governments. Furthermore, this change might be related to the rise in the current race between China and the US to see who comes out on top.

The last version of the organization’s AI principles explained how Google will be taking into account a wide array of social as well as economic factors. However, now the principles were amended to include benefits going above and beyond the risks and downfalls.

Google shared more on the matter through a blog post published on Tuesday. It hoped to be more consistent with a wide number of principles linked to international law and human rights. They will continue to evaluate certain work by assessing what benefits outweigh those risks.

The latest AI principles were shared by the Washington Post on Tuesday. It was right before the company’s Q4 earnings report. All those results missed the expectations projected by the WSJ in terms of revenue with shares dropping 9% during trading hours.

All of these AI Principles were established in the year 2018 after it declined to renew the Project Maven contract of the government. This was created to better interpret and analyze videos of drones through AI. Before the deal came to an end, there were thousands of employees signing petitions against this contract while others resigned due to Google’s involvement. We even saw the company drop out of this bidding for a staggering $10M because it was not sure about aligning with AI principles at the time.

Ever since the launch of AI on a wider scale, the leadership under Pichai has worked aggressively to pursue contacts with the federal government. This led to more strained relations inside the workforce who are very outspoken. In the last year, Google fired more than 50 workers after so many protests against its Project Nimbus.

Exclusives kept mentioning how the contract failed to violate any AI principles. The agreement gave Israel so many AI tools including image categorization, tracking of objects, and provisions for weapons owned by the state. As per the NYT, Google officials shared concerns with the deal’s signing. They felt it was violating human rights.

We’ve seen the organization crackdown against internal discussions on controversial subjects like the war in Gaza. The company updated guidelines for the internal forum at Memegen and it.

Image: DIW-Aigen


by Dr. Hura Anwar via Digital Information World