Skip to main content
The Daily Singapore

Singapore news, every day

News

By the Numbers: How Duplicate Images Are Quietly Draining Singapore's Digital Ecosystem

New data on redundant image files reveals a hidden cost to cloud storage, bandwidth, and AI training pipelines that Singapore's tech sector can no longer afford to ignore.

Share

By Singapore News Desk · Published 5 July 2026 at 2:58 am

4 min read

Updated 3 h ago· 5 July 2026 at 11:57 am

How we reported this

This article was generated by AI from the linked public sources. The Daily Singapore is independently owned and covers Singapore news free from advertiser or sponsor influence. Read our editorial standards →

By the Numbers: How Duplicate Images Are Quietly Draining Singapore's Digital Ecosystem
Photo: Judgefloro / CC0 (Wikimedia Commons)

Singapore businesses and government agencies are sitting on hundreds of millions of duplicate image files, and the storage bill is growing. Across the city-state's enterprise sector, duplicate or near-identical images now account for an estimated 30 to 40 percent of total unstructured data in corporate cloud environments, according to figures from technology audits conducted in 2025 and cited in industry briefings circulated among members of the Singapore Computer Society earlier this year. The problem is not cosmetic. At current Amazon Web Services S3 pricing — roughly USD 0.023 per gigabyte per month for standard storage in the Asia-Pacific Singapore region — a mid-sized company hoarding 50 terabytes of redundant image data pays upward of SGD 1,600 a month for files that add zero operational value.

The timing matters. Singapore's push to position itself as a regional artificial intelligence hub — anchored by the National AI Strategy 2.0 launched in December 2023 — depends heavily on clean, well-curated datasets. Duplicate images poison training pipelines, skew model outputs, and inflate compute costs. For a country betting billions on AI readiness, bloated and redundant visual data is not a minor housekeeping issue. It is a structural liability.

What the Data Actually Shows

The scale becomes clearer at the institutional level. The Infocomm Media Development Authority, which oversees Singapore's digital infrastructure policy, has in past years pushed agencies under the Smart Nation initiative to conduct data hygiene audits. Industry practitioners familiar with those exercises — speaking in general terms at a panel held at the Suntec City Convention Centre in March 2026 — described scenarios where government-linked repositories had duplicate image rates exceeding 25 percent before deduplication tools were deployed. In one case, a statutory board managing public records had accumulated more than 2 million near-identical scanned document images due to failed batch-upload checks over a four-year period.

On the commercial side, e-commerce operators on platforms tied to Lazada and Shopee — both of which operate significant Singapore-based logistics and data infrastructure — routinely deal with product image duplication at scale. A single SKU can accumulate dozens of near-identical product photographs uploaded by different sellers or re-uploaded after minor edits. Research published by the data management firm Aparavi in 2024 found that across enterprise file systems globally, image and video files account for more than 60 percent of redundant unstructured data. Apply that ratio to Singapore's estimated 1.2 exabytes of enterprise data storage capacity — a figure cited in a 2024 IMDA sector report — and the redundancy problem reaches staggering proportions.

Duplicate image replacement, the process of identifying, consolidating, and replacing redundant visual assets with a single canonical version, has been available as a software function for years. Tools from vendors including Google Cloud Vision API and open-source libraries built on perceptual hashing algorithms can flag near-duplicate images with over 95 percent accuracy. The gap is not technological. It is procedural. Many organisations in one-north's tech cluster and in the financial district around Raffles Place still lack formal data governance policies that mandate regular deduplication cycles.

What Comes Next for Singapore Organisations

The Digital Enterprise Blueprint, which the Ministry of Communications and Information released in May 2024, explicitly calls for stronger data management practices among SMEs. Consultants working with Enterprise Singapore's Digital Growth Programme have begun incorporating image deduplication assessments into their standard technology readiness reviews, particularly for clients in retail, healthcare imaging, and media production.

For organisations that have not yet acted, the arithmetic is straightforward. A one-time deduplication exercise on a 10-terabyte image archive using commercially available tools typically costs between SGD 3,000 and SGD 8,000 in consulting and licensing fees, according to pricing sheets from vendors operating in the Mapletree Business City tech precinct. Monthly savings on cloud storage and reduced data transfer costs typically recover that investment within six months.

The practical next step is an audit. Any organisation running cloud workloads in Singapore should request a storage composition report from its provider — AWS, Google Cloud, and Microsoft Azure all offer native tools that generate such breakdowns at no additional cost. The numbers, when they come back, tend to be sobering.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Singapore

Covering news in Singapore. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Singapore news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Singapore and accept our Privacy Policy. Unsubscribe anytime.

Before you go

Get the Singapore brief

The day's Singapore news in a 2-minute read. Free, weekday mornings.

No spam. Unsubscribe anytime.