<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>PSI | Economy Data Observatory</title>
    <link>http://example.org/tag/psi/</link>
      <atom:link href="http://example.org/tag/psi/index.xml" rel="self" type="application/rss+xml" />
    <description>PSI</description>
    <generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><copyright>© 2021 Daniel Antal</copyright><lastBuildDate>Sun, 16 May 2021 00:00:00 +0000</lastBuildDate>
    <image>
      <url>http://example.org/media/icon_hu455accb9bf07a92e92cfc4e482b41cba_20655_512x512_fill_lanczos_center_2.png</url>
      <title>PSI</title>
      <link>http://example.org/tag/psi/</link>
    </image>
    
    <item>
      <title>Open Data</title>
      <link>http://example.org/data/open-gov/</link>
      <pubDate>Sun, 16 May 2021 00:00:00 +0000</pubDate>
      <guid>http://example.org/data/open-gov/</guid>
      <description>&lt;p&gt;Many countries in the world allow access to a vast array of information,
such as documents under freedom of information requests, statistics,
datasets. In the European Union, most taxpayer financed data in
government administration, transport, or meteorology, for example, can
be usually re-used. More and more scientific output is expected to be
reviewable and reproducible, which implies open access.&lt;/p&gt;
&lt;table&gt;
&lt;tbody&gt;
&lt;tr class=&#34;odd&#34;&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;













&lt;figure  id=&#34;figure-whats-the-problem-with-open-datadataopen-govopen-data-problems&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img src=&#34;http://example.org/media/img/blogposts_2021/photo-1490004047268-5259045aa2b4.jpg&#34; alt=&#34;[What’s the Problem with Open Data?](/data/open-gov/#open-data-problems)&#34; loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      &lt;a href=&#34;http://example.org/data/open-gov/#open-data-problems&#34;&gt;What’s the Problem with Open Data?&lt;/a&gt;
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;













&lt;figure  id=&#34;figure-how-we-add-valuedataopen-govopen-data-value-added&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img src=&#34;http://example.org/media/img/blogposts_2021/photo-1590247813693-5541d1c609fd.jpg&#34; alt=&#34;[How We Add Value?](/data/open-gov/#open-data-value-added)&#34; loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      &lt;a href=&#34;http://example.org/data/open-gov/#open-data-value-added&#34;&gt;How We Add Value?&lt;/a&gt;
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;table&gt;
&lt;tbody&gt;
&lt;tr class=&#34;even&#34;&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;













&lt;figure  id=&#34;figure-is-there-value-in-itdataopen-govis-there-value-left-in-open-data-if-its-money-on-the-street-why-nobodys-picking-it-up&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img src=&#34;http://example.org/media/img/blogposts_2021/photo-1533580909002-a2f298d005eb.jpg&#34; alt=&#34;[Is There Value in It?](/data/open-gov/#is-there-value-left-in-open-data) If it’s money on the street, why nobody’s picking it up?&#34; loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      &lt;a href=&#34;http://example.org/data/open-gov/#is-there-value-left-in-open-data&#34;&gt;Is There Value in It?&lt;/a&gt; &lt;/br&gt;If it’s money on the street, why nobody’s picking it up?
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;













&lt;figure  id=&#34;figure-datasets-should-work-together-to-give-informationdataopen-govdata-integrationdata-is-only-potential-information-raw-and-unprocessed&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img src=&#34;http://example.org/media/img/blogposts_2021/photo-1605143185650-77944b152643.jpg&#34; alt=&#34;[Datasets Should Work Together to Give Information](/data/open-gov/#data-integration)Data is only potential information, raw and unprocessed.&#34; loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      &lt;a href=&#34;http://example.org/data/open-gov/#data-integration&#34;&gt;Datasets Should Work Together to Give Information&lt;/a&gt;&lt;/br&gt;Data is only potential information, raw and unprocessed.
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;h2 id=&#34;open-data-problems&#34;&gt;What’s the Problem with Open Data?&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;“Data is stuff. It is raw, unprocessed, possibly even untouched by human
hands, unviewed by human eyes, un-thought-about by human minds.”&lt;/em&gt; [1]&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Most open data cannot be just &lt;a href=&#34;#open-data-faq&#34;&gt;&amp;ldquo;downloaded.&amp;quot;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Often, you need to put more than $100 value of &lt;a href=&#34;#is-there-value-left-in-open-data&#34;&gt;work&lt;/a&gt; into processing, validating, documenting a dataset that is worth $100. But you can share this investment with our data observatories.&lt;/li&gt;
&lt;li&gt;Open data is almost always lacking of documentation, and no clear references to validate if the data is reliable or not corrupted. This is why we always &lt;a href=&#34;#open-data-value-added&#34;&gt;start&lt;/a&gt; with reprocessing and redocumenting.&lt;/li&gt;
&lt;/ul&gt;














&lt;figure  id=&#34;figure-our-review-of-about-80-eu-un-and-oecd-data-observatories-reveals-that-most-of-them-do-not-use-these-organizationss-open-data---instead-they-use-various-and-often-not-well-processed-proprietary-sources&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img src=&#34;http://example.org/media/img/observatory_screenshots/observatory_collage_16x9_800.png&#34; alt=&#34;Our review of about 80 EU, UN and OECD data observatories reveals that most of them do not use these organizations&amp;#39;s open data - instead they use various, and often not well processed proprietary sources.&#34; loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      Our review of about 80 EU, UN and OECD data observatories reveals that most of them do not use these organizations&amp;rsquo;s open data - instead they use various, and often not well processed proprietary sources.
    &lt;/figcaption&gt;&lt;/figure&gt;
&lt;p&gt;Read more: &lt;a href=&#34;https://dataandlyrics.com/post/2021-06-18-gold-without-rush/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Open Data - The New Gold Without the
Rush&lt;/a&gt;&lt;/p&gt;
&lt;h2 id=&#34;open-data-value-added&#34;&gt;How We Add Value?&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;We believe that even such generally trusted data sources as Eurostat
often need to be reprocessed, because various legal and political
constraints do not allow the common European statistical services to
provide optimal quality data – for example, on the regional and city
levels.&lt;/li&gt;
&lt;li&gt;With
&lt;a href=&#34;https://greendeal.dataobservatory.eu/authors/ropengov/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;rOpenGov&lt;/a&gt;
and other partners, we are creating open-source statistical software
in R to re-process these heterogenous and low-quality data into tidy
statistical indicators to automatically validate and document it.&lt;/li&gt;
&lt;li&gt;Metadata is a potentially informative data record about a
potentially informative dataset. We are carefully documenting and
releasing administrative, processing, and descriptive metadata,
following international metadata standards, to make our data easy to
find and easy to use for data analysts.&lt;/li&gt;
&lt;li&gt;We are automatically creating depositions and authoritative copies
marked with an individual digital object identifier (DOI) to
maintain data integrity.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;is-there-value-left-in-open-data&#34;&gt;Is There Value in Open Data?&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;A well-known story tells of a finance professor and a student who come across a $100 bill lying on the ground. As the student stops to pick it up, the professor says, “Don’t bother—if it were really a $100 bill, it wouldn’t be there.”&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;But this is not the case with open data.  Often, you need to put more than $100 into processing, validating, documenting a dataset that is worth $100.&lt;/p&gt;
&lt;p&gt;In the EU, open data is governed by the &lt;a href=&#34;https://eur-lex.europa.eu/legal-content/EN/TXT/?qid=1561563110433&amp;amp;uri=CELEX:32019L1024&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Directive on open data and the re-use of public sector information - in short: Open Data Directive (EU) 2019 / 1024&lt;/a&gt;. It entered into force on 16 July 2019. It replaces the &lt;a href=&#34;https://eur-lex.europa.eu/legal-content/en/ALL/?uri=CELEX:32003L0098&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Public Sector Information Directive&lt;/a&gt;, also known as the &lt;em&gt;PSI Directive&lt;/em&gt; which dated from 2003 and was subsequently amended in 2013.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Open Data&lt;/strong&gt; is &lt;em&gt;potentially&lt;/em&gt; useful data that can &lt;em&gt;potentially&lt;/em&gt; replace costlier or hard to get data sources to build information. It is analogous to potential energy: work is required to release it. We build automated systems that reduce this work and increase the likelihood that open data will offer the &lt;em&gt;best value for money&lt;/em&gt;.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Most open data is not publicy accessible, and available upon request. Our real curatorial advantage is that we know where it is and how to get this request processed.&lt;/li&gt;
&lt;li&gt;Most European open data comes from tax authorities, meteorological
offices, managers of transport infrastructure, and other
governmental bodies whose data needs are very different from yours.
Their data must be carefully evaluated, re-processed, and if
necessary, imputed to be usable for your scientific, business or
policy goals.&lt;/li&gt;
&lt;li&gt;The use of open science data is problematic in different ways:
usually understanding the data documentation requires
domain-specific specialist knowledge. &lt;a href=&#34;http://example.org/data/open-science/&#34;&gt;Open science
data&lt;/a&gt; is even more scattered and difficult to
access than technically open, but not public governmental data.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;data-integration&#34;&gt;From Datasets to Data Integration, Data to Information&lt;/h2&gt;
&lt;p&gt;“Data is only potential information, raw and unprocessed, prior to
anyone actually being informed by it.” ^[2]&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;We are building simple databases and supporting APIs that release
the data without restrictions, in a tidy format that is easy to join
with other data, or easy to join into databases, together with
standardized metadata.&lt;/li&gt;
&lt;/ul&gt;














&lt;figure  id=&#34;figure-our-service-flow-and-value-chain&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img src=&#34;http://example.org/media/img/slides/automated_observatory_value_chain.jpg&#34; alt=&#34;Our service flow and value chain&#34; loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      Our service flow and value chain
    &lt;/figcaption&gt;&lt;/figure&gt;
&lt;h2 id=&#34;open-data-faq&#34;&gt;FAQ&lt;/h2&gt;
&lt;h3 id=&#34;why-downloading-does-not-work&#34;&gt;Why Downloading Does Not Work?&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Most open data is not available on the internet.&lt;/li&gt;
&lt;li&gt;If it is available, it is not in a form that you can easily import into a spreadsheet application like Excel or OpenOffice, or into a statistical application like SPSS or STATA.&lt;/li&gt;
&lt;li&gt;Even the data quality of trusted web sources, like the Eurostat website, can be very low. Eurostat just publishes what it gets from governments, and often has no mandate to fix errors.  The data is full with missing information, and in the case of regional statistics, faulty region codes and region names that make matching your data or placing them on a map impossible.&lt;/li&gt;
&lt;li&gt;Adjusting euros with millions of euros, correctly translating dollars to euros, pounds to kilograms requires plenty of work. This is a very error-prone process when done by humans.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;can-open-data-be-used-in-machine-learning-and-ai&#34;&gt;Can Open Data be Used in Machine Learning and AI?&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Most public and open data sources have many missing observations; machine learning models usually cannot hanlde missingness. These points must be carefully imputed with approximations, which can be very challenging when the data has geographical dimension.&lt;/li&gt;
&lt;li&gt;Removing missing values makes samples extremely biased and your model will learn from omissions, not information.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;photo-credits&#34;&gt;Photo Credits&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;What&amp;rsquo;s the Problem with Open Data?&lt;/em&gt; illustration is a photo by &lt;a href=&#34;https://unsplash.com/photos/8hJQKRIQZMY&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Cristina Gottardi&lt;/a&gt;
&lt;em&gt;How We Add Value?&lt;/em&gt; illustration is a photo by &lt;a href=&#34;https://unsplash.com/photos/IEiAmhXehwE&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Nana Smirnova&lt;/a&gt;.
&lt;em&gt;Is There Value Left in It?&lt;/em&gt; is a photo by &lt;a href=&#34;https://unsplash.com/photos/GcnPjvqRL18&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Imelda&lt;/a&gt;
&lt;em&gt;Datasets Should Work Together to Give Information&lt;/em&gt; is a photo by &lt;a href=&#34;https://unsplash.com/photos/huRn8ECqADI&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Lucas Santos&lt;/a&gt;&lt;/p&gt;
&lt;h2 id=&#34;footnote-references&#34;&gt;Footnote References&lt;/h2&gt;
&lt;p&gt;[1] Pomerantz, Jeffrey. 2021. “Metadata.” MIT Press essential knowledge
series. MIT Press. Cambridge, Massachusetts ; London, England : The MIT
Press, [2015]&lt;/p&gt;
&lt;p&gt;[2] Pomerantz, Jeffrey. 2021. “Metadata.” MIT Press essential knowledge
series. MIT Press. Cambridge, Massachusetts ; London, England : The MIT
Press, [2015]&lt;/p&gt;
</description>
    </item>
    
  </channel>
</rss>
