Data format guidelines

Over the course of the Climate Watch project, you may need/wish to update the sites data. Please provide your data updates according the following formatting guidelines.

Please provide all files in comma separated value (.csv) format with utf-8 encoding. This will avoid any issues with special characters. If you have your data in an Excel table and are unsure how to produce a csv file, you can export a single sheet (the active sheet) by clicking Save As in the drop-down File menu, and selecting csv as a file-type.

Unless you have a specific reason not to do so, you should leave any missing data fields blank. If there is currently filler data fields, such as NA, N/A, None, nil, -9999, etc, please remove these values and leave the entries blank (unless you have a good reason not to do so, and it is clear from the metadata that these missing data are present).

Please ensure columns are included with the column names written in an identical manner and that the contents conform to the below data types. Entries that must be present (i.e. fields that cannot be left blank) have their column names highlighted in bold.

Please avoid using hidden columns as you work on the file as a spreadsheet, because those will end up as visible in the CSV file and in case of column name clashes wrong data may be used unexpectedly. Any columns that are not needed should be deleted. Please also do not use highlighting on the file to denote special treatment of particular rows, as this information is lost when saving as a csv and is better encoded in the file by using a separate column.

Please as much as possible aim to design files in a horizontal manner, as csv files are normally read row by row and not in columns.

NDC-SDG Linkages

For this data to be imported correctly some SDG metadata is needed first.

sdgs.csv

Column Name	Data Type	Example
number	String	SDG number e.g. 1
title	String	full goal title (might be redundant if we only use short names), e.g. End poverty in all its forms everywhere
cw_title	String	goal title as displayed in Climate Watch, e.g. No poverty

sdg_targets.csv

ndc_sdg_targets.csv

NDC Summary Content

The NDC summary data are spread across several sources: the World bank WB, CAIT, and DIE. These tables are the data source for NDC overview/comparison tables and also choropleths. The choropleths will use identical data to DIE’s klimalog.

DIE data

DIE Kimalog’s site currently downloads their entire dataset to users (~300kb) in a csv file. We can use an identical method and their identical data: http://klimalog.die-gdi.de/ndc/assets/map_data/Explorer-NDC-Data.csv These data will also be used in our summary tables. The dataset generally has two columns for every piece of data, a VIS for display as a Choropleth - these are integers indicating the category, and a TXT column containing the attributes to display on hover or in a table (the strings that should act as attributes/labels). The DIE data are divided into 6 categories ( cc_mitigation, cc_adaptation, finance_and_support, planning_process, broader_picture ), the groupings and a sentence description are given in the file NDC content_data0424.xlsx, sheet DIE indicator.

The Explorer-NDC-Data.csv file (DIE data) contains the following columns:

'countryName',
'countryCode',
'1. typeOfTargetVIS',
'1. typeOfTargetTXT',
'2. costsOfCCMVIS',
'2. costsOfCCMTXT',
'3. renewableEnergyVIS',
'3. renewableenergyTXT',
'4. energyEffVIS',
'4. energyEffTXT',
'5. transportVIS',
'5. transportTXT',
'6. carbonCaptStorVIS',
'6. carbonCaptStorTXT',
'7. agricultureVIS',
'7. agricultureTXT',
'8. landUseChangeVIS',
'8. landUseChangeTXT',
'9. wasteVIS',
'9. wasteTXT',
'10. redNonCO2GasesVIS',
'10. redNonCO2GasesTXT',
'11. fossilFuelSubsidiaryVIS',
'11. fossilFuelSubsidiaryTXT',
'12. landusechangeVIS',
'12. landusechangeTXT',
'13. reddVIS',
'13. reddTXT',
'14. tradeVIS',
'14. tradeTXT',
'15. marketMechanismsVIS',
'15. marketMechanismsTXT',
'16. mitigationDocumentsVIS',
'16. mitigationDocumentsTXT',
'17. coBenefitsCCMVIS',
'17. coBenefitsCCMTXT',
'18. migrationDisplacementVIS',
'18. migrationDisplacementTXT',
'19. vulnerabilityAgricultureVIS',
'19. vulnerabilityAgricultureTXT',
'20. vulnerabilityWaterVIS',
'20. vulnerabilityWaterTXT',
'21. vulnerabilityEcosystemsVIS',
'21. vulnerabilityEcosystemsTXT',
'22. vulnerabilityHealthVIS',
'22. vulnerabilityHealthTXT',
'23. vulnerabilityCoastalZonesVIS',
'23. vulnerabilityCoastalZonesTXT',
'24. vulnerabilityCostsOfCRHVIS',
'24. vulnerabilityCostsOfCRHTXT',
'25. futureCostsVIS',
'25. futureCostsTXT',
'26. clRisksRxtremeWeatherVIS',
'26. clRisksExtremeWeatherTXT',
'27. clRisksFloodsVIS',
'27. clRisksFloodsTXT',
'28. clRisksDroughtsVIS',
'28. clRisksDroughtsTXT',
'29. clRisksTempIncreaseVIS',
'29. clRisksTempIncreaseTXT',
'30. clRisksSeaLevelRiseVIS',
'30. clRisksSeaLevelRiseTXT',
'31. prioritySectorsWaterVIS',
'31. prioritySectorsWaterTXT',
'32. prioritySectorsAgricultureVIS',
'32. prioritySectorsAgricultureTXT',
'33. prioritySectorsHealthVIS',
'33. prioritySectorsHealthTXT',
'34. prioritySectorsEcosystemsVIS',
'34. prioritySectorsEcosystemsTXT',
'35. prioritySectorsForestryVIS',
'35. prioritySectorsForestryTXT',
'36. quantifiedAdaptationTargetVIS',
'36. quantifiedAdaptationTargetTXT',
'37. costsOfAdaptationVIS',
'37. costsOfAdaptationTXT',
'38. adaptationDocumentsVIS',
'38. adaptationDocumentsTXT',
'39. coBenefitsCCAVIS',
'39. coBenefitsCCATXT',
'40. adaptationFinanceConditionalityVIS',
'40. adaptationFinanceConditionalityTXT',
'41. mitigationFinanceConditionalityVIS',
'41. mitigationFinanceConditionalityTXT',
'42. technNeedsVIS',
'42. techNeedsTXT',
'43. techTransferVIS',
'43. techTransferTXT',
'44. capacityBuildingVIS',
'44. capacityBuildingTXT',
'45. moiFairnessVIS',
'45. moiFairnessTXT',
'46. planningOfINDCFormulationVIS',
'46. planningOfINDCFormulationTXT',
'47. stakeholderConsFormulationVIS',
'47. stakeholderConsFormulationTXT',
'48. planningOfINDCImplementationVIS',
'48. planningOfINDCImplementationTXT',
'49. assessmentAndReviewVIS',
'49. assessmentAndReviewTXT',
'50. conjunctionWithSDGsVIS',
'50. conjunctionWithSDGsTXT',
'51. conjunctionWithNatSustDevtVIS',
'51. conjunctionWithNatSustDevtTXT',
'52. fossilFuelSubsReformVIS',
'52. fossilFuelSubsReformTXT',
'53. greenEconomyVIS',
'53. greenEconomyTXT',
'54. disasterRiskReductionVIS',
'54. disasterRiskReductionTXT',
'55. sectionOnFairnessVIS',
'55. sectionOnFairnessTXT',
'56. historicalResponsibVIS',
'56. historicalResponsibTXT',
'57. lossAndDamageVIS',
'57. lossAndDamageTXT',
'58. genderVIS',
'58. genderTXT',
'59. humanRightsVIS',
'59. humanRightsTXT',
'byIncomeVIS',
'byIncomeTXT',
'byRegionVIS',
'byRegionTXT',
'g20VIS',
'g20TXT',

CAIT data

Currently these data seems to be in NDC content data 0424.xlsx. Looks like they need cleaning. We will ask WRI to clean these data and move these into a csv format, as testing indicates the file will be small. E.g. currently there are values in percent with % symbol attached, html instead of unicode text, dates some as just date others with time attached, column names should be snake case, we should also have metadata dictionary mapping the column name to description. “” used to encapsulate any string with a comma inside. Blanks permitted.

NDC Files

Collection of .md files, stored in a S3 bucket.

Historical Climate data

Data sources

Sectors

Please note: rows which define subsectors should follow rows which defined their parents.

Countries

Countries and country groups for which data is collected are referenced by a code, which is the 3-digit iso codes in case of countries and a custom code in case of country groups.

column name	data type
iso_code3	String, 3-digit iso code for countries and custom code for country groups, e.g. ‘USA’
iso_code2	String, 2-digit iso code for countries and custom code for country groups, e.g. ‘USA’
wri_standard_name	String, name as displayed in the system, e.g. ‘United States’
location_type	String, COUNTRY, REGION or GROUP
show_in_cw	String, ‘no’ if location not to be displayed
pik_name	String, name as used in PIK dataset
cait_name	String, name as used in CAIT dataset
ndcp_navigators_name	String, name as used in NDCP Navigators
unfccc_group	String, e.g. UNFCCC Non-Annex I

Country groups and regions

column name	data type
parent_iso_code3	String, 3-digit iso code for country groups, e.g. ‘EUU’
name	String, name of country who is a member of a group, e.g. ‘Poland’ (not required)
iso_code3	String, 3-digit iso code for country who is member of a group, e.g. ‘POL’

Emissions

column name	data type
country	String matching the `code` attribute of countries
source	String matching the `name` property of sources
sector	String matching the `name` attribute of sectors
gas	String matching the `gas` property of sectors
gwp	String, e.g. ‘AR2’
year	Integer, e.g. 2000
value	Float, e.g. 5.8