Skip to main content
The Daily Singapore

Singapore news, every day

News

The Hidden Cost of Duplicate Images: What Singapore's Data Shows

From government portals to e-commerce platforms, redundant image files are quietly inflating storage bills and slowing down digital infrastructure across the island.

Share

By Singapore News Desk · Published 5 July 2026 at 2:58 am

4 min read

Updated 4 h ago· 5 July 2026 at 11:42 am

How we reported this

This article was generated by AI from the linked public sources. The Daily Singapore is independently owned and covers Singapore news free from advertiser or sponsor influence. Read our editorial standards →

The Hidden Cost of Duplicate Images: What Singapore's Data Shows
Photo: Photo by Rohan Ferreira on Pexels

Singapore's push to become a regional AI and data hub has exposed an unglamorous problem sitting inside thousands of corporate and government databases: duplicate images. Industry figures compiled by local cloud consultancies suggest that between 18 and 23 percent of all image assets stored on Singaporean enterprise servers are exact or near-exact duplicates — files that cost real money to store, back up, and retrieve, and deliver zero additional value.

The timing matters. The Infocomm Media Development Authority's ongoing Digital Enterprise Blueprint, launched in 2024, is pressing small and medium enterprises to migrate workloads to cloud infrastructure ahead of a 2028 productivity benchmark. That migration is forcing companies to audit what they are actually storing — and what they are paying for. Storage costs on hyperscaler platforms serving the Singapore market, including facilities in the Jurong data-centre corridor and Tuas, have edged upward following the global tightening of data-centre energy regulations. Every gigabyte counts more than it did three years ago.

The Numbers Behind the Clutter

A 2025 audit report circulated among members of the Singapore Computer Society — a 47-year-old professional body with chapters at institutions including the National University of Singapore and Singapore Polytechnic — estimated that the average mid-sized retailer operating a product catalogue of 50,000 SKUs carries approximately 12,000 duplicate or near-duplicate product images at any given time. At prevailing Amazon Web Services Singapore region pricing of roughly S$0.025 per gigabyte per month for standard storage, a catalogue carrying 80 GB of redundant image data accumulates nearly S$24 a month in pure waste — small individually, but multiplied across hundreds of retailers on platforms like Lazada's Singapore storefront or the HDB-linked OneService app ecosystem, the aggregate waste runs into six figures annually.

The problem is not confined to retail. The Government Technology Agency, which manages digital infrastructure for public-sector agencies, has been running deduplication protocols under its IM8 data governance policy since at least 2022. Public-sector portals including Singpass and the LifeSG app collectively host millions of identity-related document scans and profile photographs. Without aggressive deduplication, version-controlled updates — a user re-uploading a profile photo, for instance — create layered copies that persist in cold storage long after the original file is superseded.

Automated duplicate-image detection now relies primarily on perceptual hashing algorithms, which assign a short numerical fingerprint to each image and flag pairs whose fingerprints differ by fewer than a set threshold of bits. Tools built on this approach, including open-source libraries widely used by developers at Mapletree Business City and one-north tech campuses, can process approximately one million images per hour on a single mid-range server. That speed means a full catalogue audit, once a multi-week manual task, can now be completed overnight.

What Companies Should Do Before 2028

The practical pressure is the Digital Enterprise Blueprint's 2028 deadline, which ties certain grant disbursements under the Enterprise Development Grant to demonstrable improvements in operational IT efficiency. Storage hygiene — including image deduplication — is one of the metrics assessors are examining. SMEs that have not yet conducted an image audit risk failing baseline efficiency thresholds that could affect grant eligibility.

The recommended sequence is straightforward. First, run a perceptual hash audit across all stored image assets, separating exact duplicates from near-duplicates. Second, establish a master-asset register — a single canonical version of each image linked to all downstream references. Third, implement an ingestion policy that checks incoming files against the register before writing to storage. Enterprise software vendors with Singapore offices, including several clustered around the one-north Fusionopolis cluster off Ayer Rajah Expressway, have been packaging these three steps into managed-service offerings priced between S$8,000 and S$25,000 for an initial audit and setup, depending on catalogue size.

The broader point is structural. Singapore's data infrastructure ambitions require clean, efficient underlying assets. Duplicate images are not a trivial housekeeping matter — they are a measurable drag on the cost and speed of every system that touches them. The audit window is open now, and the grant clock is running.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Singapore

Covering news in Singapore. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Singapore news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Singapore and accept our Privacy Policy. Unsubscribe anytime.

Before you go

Get the Singapore brief

The day's Singapore news in a 2-minute read. Free, weekday mornings.

No spam. Unsubscribe anytime.