In Depth Building of the Federal Statistical Office in Wiesbaden

Published on April 26th, 2016 | by Daniel Castro and Paul MacDonnell

0

Germany’s Proposed Data Retention Law for Government Statistics Would Harm Economic Research

The German parliament is considering legislation that would require the government to delete certain statistical data it collects about firms after 10 years, a change that would deal a severe blow to important economic research that depends on longitudinal data. In addition to threatening to derail important research, this proposal demonstrates how poorly some policymakers understand the government’s critical role in producing high-value data sets. Not only should the German parliament reject this proposal, it should affirm the value of government efforts to produce ever more comprehensive and complete statistical data for research and analysis.

Until recently, Germany had a very positive story to tell about how it was enabling important economic research through its innovative data management practices. The German government frequently surveys firms to collect detailed data about the economy. While these surveys provide useful information on their own, the government recognized that their value would be enhanced considerably if data from multiple surveys could be brought together so that more variables could be analysed at once and these variables could be tracked over time. To achieve this, the German government uses the same identifier for each firm in every survey and has combined these data sets through a project known as AFiD (Amtliche Firmendaten für Deutschland or “official firm data for Germany”).

However, the viability of these combined data sets is at risk. On April 27, 2016, the Civil Liberties Committee of Germany’s Parliament, the Bundestag, will debate a proposal to amend the Federal Statistics Act (BStatG, Bundesstatistikgesetz) to require the Federal Statistical Office (Destatis) to delete the company identification numbers used to match data across surveys after 10 years. Deleting these records will prevent researchers from tracking firms over time, thus depriving economists and policymakers of an important tool for monitoring and understanding the changing conditions of the German economy.

The reaction from researchers has been fierce. As Theresia Bauer, Minister of Science, Research, and the Arts for the German state of Baden-Württemberg warned, imposing this restriction would jeopardize many important research projects that rely on longitudinal data about firms that spans decades. The German Economics Association—which has 4,000 members—echoed a similar concern in an open letter of protest.

This proposal is a misguided attempt to protect confidential firm data that would do more harm than good. Some policymakers mistakenly believe that limits should be placed on how much data an organization retains because less data means more privacy. Unfortunately, this type of thinking often threatens data-driven innovation. In reality, the length of time an organization retains data does not affect the security of the data, but it does impact the usability of the data. Moreover, in this particular case, the Federal Statistical Office already employs a number of safeguards to protect the confidentiality of the information it collects from firms including anonymizing the data, requiring researchers to access the data through secure research facilities, and imposing strict terms of use on all researchers.

Given the benefits of these longitudinal data sets and the existing protections in place to safeguard the confidentiality of firm data, German lawmakers should reject attempts to limit how long the Federal Statistics Office retains its merged survey data. Instead, lawmakers should be working to support efforts to improve the quality, timeliness, and completeness of government statistics not only to enable better academic research, but also to help policymakers better understand the economy. And going forward, lawmakers would be wise to listen to the concerns of scientists and researchers before proposing policies that would erase data that is vital to research.

Image: Wikipedia


About the Author

Daniel Castro is the director of the Center for Data Innovation and vice president of the Information Technology and Innovation Foundation. Mr. Castro writes and speaks on a variety of issues related to information technology and internet policy, including data, privacy, security, intellectual property, internet governance, e-government, and accessibility for people with disabilities. His work has been quoted and cited in numerous media outlets, including The Washington Post, The Wall Street Journal, NPR, USA Today, Bloomberg News, and Businessweek. In 2013, Mr. Castro was named to FedScoop’s list of “Top 25 most influential people under 40 in government and tech.” In 2015, U.S. Secretary of Commerce Penny Pritzker appointed Mr. Castro to the Commerce Data Advisory Council. Mr. Castro previously worked as an IT analyst at the Government Accountability Office (GAO) where he audited IT security and management controls at various government agencies. He contributed to GAO reports on the state of information security at a variety of federal agencies, including the Securities and Exchange Commission (SEC) and the Federal Deposit Insurance Corporation (FDIC). In addition, Mr. Castro was a Visiting Scientist at the Software Engineering Institute (SEI) in Pittsburgh, Pennsylvania where he developed virtual training simulations to provide clients with hands-on training of the latest information security tools. He has a B.S. in Foreign Service from Georgetown University and an M.S. in Information Security Technology and Management from Carnegie Mellon University.



Back to Top ↑

Show Buttons
Hide Buttons