03 Jun '11, 9am
The Challenge of Aggregating Sustainability Data Sets
It was hard to match up these ten data sets. One problem is that each firm has its own approach to recording, displaying, and abbreviating the names of the companies they follow. As a result, it is necessary to store name variants for each SRI firm and research whether Company A from one firm is really the same as Company B at another firm, etc. A more substantive problem is that each firm uses a different “schema” for organizing its data and different definitions for the behavior it measures. One source says that a company’s board is diverse, another may say that 10% of the company’s directors are women, and the third source may say that the company has a board diversity policy. We had to correlate and harmonize all of these different measures.