<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Administrative Data on Kailas Venkitasubramanian</title>
    <link>/tags/administrative-data/</link>
    <description>Recent content in Administrative Data on Kailas Venkitasubramanian</description>
    <generator>Hugo</generator>
    <language>en</language>
    <lastBuildDate>Sun, 15 Sep 2024 00:00:00 +0000</lastBuildDate>
    <atom:link href="/tags/administrative-data/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Bayesian Improved Surname Geocoding: How It Works and Where We Use It</title>
      <link>/blog/posts/2024-09-15-bayesian-improved-surname-geocoding/</link>
      <pubDate>Sun, 15 Sep 2024 00:00:00 +0000</pubDate>
      <guid>/blog/posts/2024-09-15-bayesian-improved-surname-geocoding/</guid>
      <description>&lt;p&gt;If you work with administrative data long enough, you run into the same wall eventually: the dataset has everything you need except race and ethnicity. Hospital discharge records, voter files, tax records, benefits enrollment data — these are often rich with information about where people live, what services they use, and what outcomes they experience. But ask whether they capture race or ethnicity, and the answer is usually no, or inconsistently, or only in ways that aren&amp;rsquo;t usable.&lt;/p&gt;</description>
    </item>
    <item>
      <title>CRDT - Data Privacy </title>
      <link>/talk/2024-04-10-crdt-data-privacy/</link>
      <pubDate>Wed, 10 Apr 2024 00:00:00 +0000</pubDate>
      <guid>/talk/2024-04-10-crdt-data-privacy/</guid>
      <description>On data privacy practice at the Charlotte Regional Data Trust</description>
    </item>
    <item>
      <title>Designing Reproducible Data Pipelines for Community Research</title>
      <link>/blog/series/reproducible-research-series/2022-04-10-designing-reproducible-data-pipelines/</link>
      <pubDate>Sat, 09 Mar 2024 00:00:00 +0000</pubDate>
      <guid>/blog/series/reproducible-research-series/2022-04-10-designing-reproducible-data-pipelines/</guid>
      <description>&lt;p&gt;In the first post of this series, I argued that reproducibility is not a technical luxury for community research institutions—it is an ethical and operational obligation. In this post, I want to move from philosophy to plumbing—because this is where reproducibility becomes real.&lt;/p&gt;&#xA;&lt;p&gt;Specifically: what does it mean to design &lt;em&gt;reproducible data pipelines&lt;/em&gt; in a community research environment?&lt;/p&gt;&#xA;&lt;p&gt;At the UNC Charlotte Urban Institute, this question became concrete as we built the &lt;strong&gt;Quality of Life Explorer&lt;/strong&gt;, developed deposit and extraction pipelines for the &lt;strong&gt;Charlotte Regional Data Trust&lt;/strong&gt;, and began orchestrating workflows using Apache Airflow in an AWS environment.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Revamping the Charlotte Regional Data Trust - The story so far</title>
      <link>/blog/posts/2023-05-22-revamping-the-charlotte-regional-data-trust-the-story-so-far/</link>
      <pubDate>Mon, 22 May 2023 00:00:00 +0000</pubDate>
      <guid>/blog/posts/2023-05-22-revamping-the-charlotte-regional-data-trust-the-story-so-far/</guid>
      <description></description>
    </item>
    <item>
      <title>CRDT Anonymization and Privacy Project</title>
      <link>/project/crdt-anonymization-and-privacy-project/</link>
      <pubDate>Mon, 06 Jun 2022 00:00:00 +0000</pubDate>
      <guid>/project/crdt-anonymization-and-privacy-project/</guid>
      <description>&lt;div id=&#34;&#34; class=&#34;panelset&#34;&gt;&#xA;  &#xD;&#xA;&lt;div class=&#34;panel&#34;&gt;&#xA;  &lt;div class=&#34;panel-name&#34;&gt;Summary&lt;/div&gt;&#xA;  &#xA;  &lt;p&gt;&#xA;&#xA;&#xA;&#xA;&lt;h5 id=&#34;objectives-and-scope&#34;&gt;Objective(s) and Scope&#xA;  &lt;a href=&#34;#objectives-and-scope&#34;&gt;&lt;/a&gt;&#xA;&lt;/h5&gt;&#xA;&lt;p&gt;The project involves the development and implementation of protocols and best practices in privacy for data dissemination, including statistical disclosure control procedures of the integrated data system hosted by CRDT. This includes creating guidelines for data collection, storage, and dissemination, as well as implementing robust technical measures to prevent unauthorized access and disclosure of sensitive information.&lt;/p&gt;&#xA;&lt;p&gt;The project will start with a comprehensive review of current privacy practices and identification of areas that need improvement. Based on this review, a set of protocols and best practices will be developed and incorporated into the organization&amp;rsquo;s data management processes. This will include the implementation of statistical disclosure control procedures to protect sensitive information. Training sessions will be conducted to educate employees on the new privacy protocols and best practices.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Institute for Social Capital - Data Privacy &amp; Security - Present and Future</title>
      <link>/talk/data-privacy-security/</link>
      <pubDate>Mon, 06 Dec 2021 00:00:00 +0000</pubDate>
      <guid>/talk/data-privacy-security/</guid>
      <description>On data privacy practice at the Institute of Social Capital</description>
    </item>
  </channel>
</rss>
