Singapore's public and private sectors are sitting on tens of millions of duplicate image files, and the bill for storing them is growing faster than most organisations want to admit. A review of digital asset management practices across several local industries shows that duplicate images — identical or near-identical files saved multiple times across different folders, servers and cloud buckets — now account for an estimated 30 to 40 percent of total image storage in large enterprise environments. That figure, cited by technology consultancies operating out of the one-north business park in Queenstown, is not a rounding error. It translates directly into wasted expenditure at a time when cloud storage costs in Southeast Asia remain elevated.
The timing matters. Singapore's Smart Nation and Digital Government Office has been pushing agencies to accelerate digitisation of physical records since 2021, and the volume of scanned documents, photographs and visual assets held by public bodies has grown sharply. The National Archives of Singapore, based on Canning Rise near Fort Canning Park, has digitised more than 10 million records over the past decade. When files are ingested quickly, duplicates multiply. A photograph scanned once and then re-uploaded during a system migration, a staff reshuffle or a departmental merger does not announce itself — it simply takes up space and costs money.
What the Storage Bills Actually Look Like
Cloud storage is not free. Amazon Web Services S3 standard storage in the Asia Pacific (Singapore) region is priced at approximately USD 0.025 per gigabyte per month. A single high-resolution scanned document image runs between 2 and 5 megabytes. Multiply that across an organisation holding 500,000 images — a modest figure for a medium-sized government statutory board — and the duplicate overhead alone can cost tens of thousands of dollars annually, before bandwidth and retrieval fees are counted.
At the enterprise end, the numbers are starker. Technology firms at the Mapletree Business City complex in Alexandra have reported internal audits finding duplication rates above 35 percent in their marketing asset libraries. One audit, conducted in the first quarter of 2026, found a single product photograph saved under 47 different filenames across six cloud storage buckets. The image itself was 8.3 megabytes. That one file, duplicated across systems, was costing the organisation an estimated SGD 180 per year in pure storage fees — a trivial sum on its own, but representative of a pattern replicated thousands of times across a single company's archive.
The Infocomm Media Development Authority has flagged data hygiene as part of its Digital Enterprise Blueprint, which was updated in 2024. The blueprint encourages companies to adopt automated deduplication tools before migrating to new cloud environments, but adoption among small and medium enterprises remains uneven. Many SMEs based in areas like Toa Payoh and Jurong East lack dedicated IT staff to run periodic deduplication audits.
The Fix Is Algorithmic, But Adoption Is Slow
Deduplication software uses perceptual hashing — a technique that generates a short numeric fingerprint for each image — to flag files that are visually identical even if they carry different filenames, resolutions or metadata. Tools capable of scanning 1 million images in under two hours are commercially available and widely used by larger media organisations. The Mediacorp campus at Andrew Road has deployed image deduplication as part of its digital asset management workflow since at least 2023, according to publicly available case materials shared at a GovTech-organised conference that year.
The practical advice for organisations yet to act is straightforward. Run a deduplication audit before the next storage contract renewal. Most cloud providers, including Google Cloud Platform and Microsoft Azure — both of which maintain Singapore data centres — offer native deduplication reporting tools within their storage dashboards. For organisations on legacy on-premise servers, open-source tools such as dupeGuru can process image libraries without specialist staff. The National Library Board's digital preservation guidelines, last updated in March 2025, also recommend a minimum annual audit cycle for all digitised collections. Waiting until a system migration to discover the problem typically costs three to five times more to fix than catching it during routine maintenance.