GDD’s Fundamental Data Philosophies
With many years of experience in many environments and countries, GDD has developed and evolved our own data management philosophy. This consists of a combination of some basic high-level philosophies, underpinned by our very specific technical data management standards and procedures.

Our Data Philosophies
These core data management philosophies guide strongly the design and implementation of the data environments we build, and the database applications and utilities that GDD have developed.
They seek to address the following questions relevant to the technical data universe! In summary these include –
The TWO Master Technical Data Sets
‘Begin with the end in mind…’
There are several key requirements for your master technical data if it is to provide the best return on investment made in collecting it. Every project, process or discipline that relies on the analysis of specific technical data should be maintaining two master data sets.
-
- The Master Working Technical Data Set – The ‘single point of truth’; the assembled, standardised, integrated, validated data sets you build for your project analysis and decision-making requirements
- The Master Source/Original Data Set – The original as-collected, inherited, received, scrounged or pilfered data in its complete original untouched form. These components will often exist in two places; as original as-received system files, and within the database, with links to the working data set and the original system files
The Master WORKING Technical Data Set
The master working technical data sets, the assembled, integrated and standardised data ultimately used for analysis and decision making, should be –
-
- ‘The data, the whole data and nothing but the data’
- Instantiated – The master data must actually exist / be visible or ‘observable’; it should not rely on arcane queries or processes to extract it or make it visible
- Common Data Type Instantiation – All standard or commonly expected observation or measurement data types should have a single, fixed and unambiguous location.
- Logical, Clear and Consistent Data Structure – Clear, transparent and unambiguous data structure (schema) – with understandable, logical and consistent object naming conventions
- Contained Data – must be complete, integrated and standardised
-
- Contain only the ‘accepted’ (highest priority, ‘golden’) data value for any item
- Data Integrity – Must be accurate and regularly validated
- Accessible – No matter who assembles or manages your data, you should maintain an accessible current local copy at all times
- Secure and Quarantined – The data set should be segregated from any intermediate, working, QAQC or additional measurement (lower priority) data that should not be included be included in the master working data set
-
- Should also be protected from inadvertent MCU’s or malicious damage. E.g. set to read-only once validated
- Auditable – The data should be linked to its source / original data records
The Master SOURCE Technical Data Set
The master source / original technical data collection should likewise be –
-
- Complete and Unaltered – The source data should be retained exactly as collected or received (following validation)
- Accessible
- Secure and Quarantined
- Auditable – With provenance data / links to identify its origin
Data Analysis is DATA Analysis!
-
- Understand that all data analysis processes and applications, including the newly emerging technologies, rely on a complete, sound underlying master technical data set. …Which is why it’s called ‘data analysis’!
Understand Your Technical Data’s Value And Importance
-
- Understand the full and fundamental importance and significant value of your technical data
Maintain That ‘Single Point Of Truth’
-
- Ensure that the master or ‘gospel’ copy of each dataset type exists in one place only
The Technical Data Life Cycle
There are three distinct activity phases in the data life cycle –
-
- Collect – Getting the data in the first place, identifying and storing it
- Manage – Validation and integrity checking, accessibility and security, standardisation and assembly of the master working data sets
- Analyse – Using the assembled data; analysis, evaluation, extraction and reporting; decision making
Link and Use Of ALL Your DataUnderstand what master technical data consists of; these days; it’s much more than the letters and numbers! What aspects need to be considered to assemble all of the relevant data types and attributes useful to the project? GDD recognise and use four data-use classes –
|
|
Link and Use Of ALL Your Data
Understand what master technical data consists of; these days; it’s much more than the letters and numbers!
What aspects need to be considered to assemble all of the relevant data types and attributes useful to the project?
GDD recognise and use four data-use classes –
-
- Primary Data – The conventional alpha-numeric data
- Secondary Data – generally multi-media data and information types related to specific primary data records (e.g. core tray and other photos, videos satellite and aerial images, field drawings and maps etc.)
-
- Linked to and retrievable from the related primary data record
- Tertiary Data – files / data of any type that may not add information, but serves to verify the data in the primary and secondary data sets. (E.g. survey or lab assay certificates, historical report PDF’s)
-
- Again with links to their relevant primary and secondary data records for instant auditability
- Metadata – Data about the data records as stored; date created / modified, source data and original file details, data and validation status, data change-control stuff etc.
Structure Your Master Data Environment
-
- Know how to structure and implement your master technical data / database
- Understand and provide the data management characteristics and attributes, as dictated by the type of data being assembled
- Understand the data collection, management and analysis processes that are required
Segregate Your Master Technical Data
-
- Keep your master technical data and database separate from and independent of all application and work folders and files
Make Data Access Simple And Intuitive
|
|
Data Validation And Integrity Checking
|
|
Standards And Procedures
|
|
Capture And Retain Data Provenance
|
|
New Technologies
-
- AI, Master Technical Data, and The Need For Some RI – AI’s have significant and valuable potential, but also associated risks. They (nearly) all require an underlying platform of sound ‘foundational’ data. There is a real need to apply some serious RI as we start to make use of them








