OACoder: Postcode Coding Tool
Journal of Open Research Software, 1(1), e5. DOI: 10.5334/511ba2c94d661
Abstract
Geodemographic classifications are small area classifications of social, economic and demographic characteristics. The Output Area Classification (OAC) is a free geodemographic classification. It is an Office of National Statistics validated measure that summarises neighbourhood conditions at the Output Area Level across the United Kingdom. Linkage of these valuable statistics has been problematic for users more used to address records that are georeferenced using unit postcodes. OACoder resolves this problem by allowing users to link corresponding OAC codes to each of the postcode addresses. OACoder is an open source software, and it is developed and tested to work on different versions of windows operating systems. It is stored in Figshare. The source code of the OACoder is stored in SourceForge. As open source software, OACoder has reuse potential across a range of applications. The functionality of OACoder can be extended to work with the new version of OAC (2011 OAC). It is also possible to reuse the source code and extend the functionality to work on different operating systems other than Windows. Different components of the software can be reused for the purpose of reading/writing CSV files and handling large data sets.
Extended Summary
This paper describes the development of OACoder, an open-source software tool designed to link UK postcodes with geodemographic classification codes from the Output Area Classification system. Geodemographic classifications are statistical systems that categorise small areas based on their social, economic, and demographic characteristics, operating on the principle that where people live reveals important information about their identity and circumstances. The Output Area Classification (OAC) is a free, open geodemographic system developed using UK Census data that classifies neighbourhoods into seven categories: Blue Collar Communities, City Living Areas, Countryside, Prospering Suburbs, Constrained by Circumstances, Typical Traits, and Multicultural Areas. However, whilst the OAC system uses Output Areas as its geographical foundation, most address records in practical applications use unit postcodes, creating a technical barrier for researchers and practitioners wanting to apply geodemographic analysis to their data. The software was developed using Java programming language and an object-oriented design approach, making it platform-independent and modular. OACoder utilises the National Statistics Postcode Directory (ONSPD) from the Office for National Statistics, which contains OAC codes for each unit postcode in the UK. The tool reads CSV files containing lists of postcodes and automatically appends the corresponding geodemographic classification codes, handling datasets efficiently even when processing against the complete UK postcode database of 1.8 million records. Testing was conducted across multiple Windows operating systems, including Windows NT, 2000, XP, Vista, 7, and Server 2008, with minimum system requirements of 4GB memory and preferably a dual-core processor. The research demonstrates significant reuse potential for the software’s modular components. Individual modules can be adapted for reading and writing large CSV files, handling substantial datasets efficiently, or extended to work with updated classification systems such as the anticipated 2011 OAC based on more recent census data. The object-oriented architecture also enables adaptation for other operating systems beyond Windows, potentially expanding accessibility for Mac and Linux users in research environments. This development addresses a practical need in applied geography, market research, public health, and social policy analysis where postcode-based data requires geodemographic contextualisation. By providing an open-source solution with transparent methodology, the research contributes to more accessible and reproducible spatial analysis, contrasting with proprietary ‘black box’ commercial geodemographic systems that lack documentation of their classification methods.
Key Findings
- OACoder successfully bridges the gap between postcode-referenced data and Output Area Classification geodemographic codes through automated CSV processing.
- The software efficiently handles large datasets, matching user postcodes against 1.8 million UK postcodes using modular Java-based architecture.
- Open-source design enables component reuse for CSV file handling, large data processing, and extension to different operating systems.
- Modular architecture facilitates future adaptation for updated classification systems, including the anticipated 2011 Output Area Classification.
- The tool provides transparent, accessible alternative to proprietary geodemographic systems whilst maintaining professional-grade processing capabilities.
Citation
@article{unknown2013oacoder,
author = {},
title = {OACoder: Postcode Coding Tool},
journal = {Journal of Open Research Software},
year = {2013},
volume = {1(1)},
pages = {e5},
doi = {10.5334/511ba2c94d661}
}