<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Community Research on Kailas Venkitasubramanian</title>
    <link>/tags/community-research/</link>
    <description>Recent content in Community Research on Kailas Venkitasubramanian</description>
    <generator>Hugo</generator>
    <language>en</language>
    <lastBuildDate>Tue, 11 Nov 2025 00:00:00 +0000</lastBuildDate>
    <atom:link href="/tags/community-research/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Some Thoughts on AI-Augmented Community Research at the Charlotte Urban Institute</title>
      <link>/blog/posts/2026-02-11-ai-augmented-community-research/</link>
      <pubDate>Tue, 11 Nov 2025 00:00:00 +0000</pubDate>
      <guid>/blog/posts/2026-02-11-ai-augmented-community-research/</guid>
      <description>&lt;p&gt;I thought of compiling a few thoughts on using AI at the institute or broadly in organizations similar to ours. I&amp;rsquo;ve been using AI tools in my research for over a year and I still can&amp;rsquo;t decide if I&amp;rsquo;m more excited or more unsettled but I feel it&amp;rsquo;s more of the former these days. But this ambivalence feels like the right starting point for a conversation about where we, as an institute built on community trust and research, actually want to go with this. When I say AI, I mean the generative AI kind.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Embracing multilingualism in data science</title>
      <link>/blog/series/reproducible-research-series/2025-04-10-multilingualism-in-data-science/</link>
      <pubDate>Thu, 10 Apr 2025 00:00:00 +0000</pubDate>
      <guid>/blog/series/reproducible-research-series/2025-04-10-multilingualism-in-data-science/</guid>
      <description>&lt;p&gt;Both of those efforts — reproducibility and pipelines — rest on a more basic question: which programming languages should a small research team actually use? In the previous posts of this series, I covered &#xA;&lt;a href=&#34;/blog/series/reproducible-research-series/2022-07-08-building-blocks-of-a-reproducible-research-framework/&#34;&gt;why reproducibility matters&lt;/a&gt; and how we are &#xA;&lt;a href=&#34;/blog/series/reproducible-research-series/2022-04-10-designing-reproducible-data-pipelines/&#34;&gt;designing reproducible data pipelines&lt;/a&gt; at the UNC Charlotte Urban Institute. This post is about the layer underneath both.&lt;/p&gt;&#xA;&lt;p&gt;Specifically, I want to argue that embracing &lt;em&gt;multilingualism&lt;/em&gt;&amp;mdash;fluency in both R and Python, rather than loyalty to one&amp;mdash;has quietly done more for our team&amp;rsquo;s output than almost any other choice we&amp;rsquo;ve made.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Switching from ArcGIS to QGIS (and a bit of R too)</title>
      <link>/blog/posts/2025-02-18-switching-from-arcgis-to-qgis/</link>
      <pubDate>Tue, 18 Feb 2025 00:00:00 +0000</pubDate>
      <guid>/blog/posts/2025-02-18-switching-from-arcgis-to-qgis/</guid>
      <description>&lt;p&gt;I have been using ArcGIS for longer than I care to admit. I started with it during my postgraduate years, and for a long time it was just &lt;em&gt;the&lt;/em&gt; GIS software, the one everyone used, the one you learned if you wanted to do spatial analysis seriously. Our university has an enterprise license, so it has always been available, and old habits die hard.&lt;/p&gt;&#xA;&lt;p&gt;But lately, I find myself opening it less and less. And when I do, there is usually a nagging feeling that I could be doing this in QGIS instead.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Bayesian Improved Surname Geocoding: How It Works and Where We Use It</title>
      <link>/blog/posts/2024-09-15-bayesian-improved-surname-geocoding/</link>
      <pubDate>Sun, 15 Sep 2024 00:00:00 +0000</pubDate>
      <guid>/blog/posts/2024-09-15-bayesian-improved-surname-geocoding/</guid>
      <description>&lt;p&gt;If you work with administrative data long enough, you run into the same wall eventually: the dataset has everything you need except race and ethnicity. Hospital discharge records, voter files, tax records, benefits enrollment data — these are often rich with information about where people live, what services they use, and what outcomes they experience. But ask whether they capture race or ethnicity, and the answer is usually no, or inconsistently, or only in ways that aren&amp;rsquo;t usable.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Using tidycensus to Analyze ACS PUMS Data</title>
      <link>/blog/posts/2024-05-12-analyzing-census-pums-data-with-tidycensus/</link>
      <pubDate>Sun, 12 May 2024 00:00:00 +0000</pubDate>
      <guid>/blog/posts/2024-05-12-analyzing-census-pums-data-with-tidycensus/</guid>
      <description>&lt;p&gt;If you&amp;rsquo;ve spent any time working with Census data, you know the drill: pull a pre-aggregated table, get median household income by county, move on. It works, and for a lot of questions, it&amp;rsquo;s exactly what you need. But sometimes the published tables just don&amp;rsquo;t cut it. What if you want to look at wage distributions for workers with specific educational credentials? Or model individual-level outcomes rather than tract-level averages? That&amp;rsquo;s where PUMS comes in — and once you start using it, it&amp;rsquo;s hard to go back.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Designing Reproducible Data Pipelines for Community Research</title>
      <link>/blog/series/reproducible-research-series/2022-04-10-designing-reproducible-data-pipelines/</link>
      <pubDate>Sat, 09 Mar 2024 00:00:00 +0000</pubDate>
      <guid>/blog/series/reproducible-research-series/2022-04-10-designing-reproducible-data-pipelines/</guid>
      <description>&lt;p&gt;In the first post of this series, I argued that reproducibility is not a technical luxury for community research institutions—it is an ethical and operational obligation. In this post, I want to move from philosophy to plumbing—because this is where reproducibility becomes real.&lt;/p&gt;&#xA;&lt;p&gt;Specifically: what does it mean to design &lt;em&gt;reproducible data pipelines&lt;/em&gt; in a community research environment?&lt;/p&gt;&#xA;&lt;p&gt;At the UNC Charlotte Urban Institute, this question became concrete as we built the &lt;strong&gt;Quality of Life Explorer&lt;/strong&gt;, developed deposit and extraction pipelines for the &lt;strong&gt;Charlotte Regional Data Trust&lt;/strong&gt;, and began orchestrating workflows using Apache Airflow in an AWS environment.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Changing CRDT operations under a Cloud</title>
      <link>/blog/series/crdt-telenovela-series/2023-06-09-making-sense-of-data-and-documenting-it/</link>
      <pubDate>Fri, 09 Jun 2023 00:00:00 +0000</pubDate>
      <guid>/blog/series/crdt-telenovela-series/2023-06-09-making-sense-of-data-and-documenting-it/</guid>
      <description>&lt;h2 id=&#34;the-promise-and-peril-of-a-large-contract&#34;&gt;The promise and peril of a large contract&#xA;  &lt;a href=&#34;#the-promise-and-peril-of-a-large-contract&#34;&gt;&lt;svg class=&#34;anchor-symbol&#34; aria-hidden=&#34;true&#34; height=&#34;26&#34; width=&#34;26&#34; viewBox=&#34;0 0 22 22&#34; xmlns=&#34;http://www.w3.org/2000/svg&#34;&gt;&#xA;      &lt;path d=&#34;M0 0h24v24H0z&#34; fill=&#34;currentColor&#34;&gt;&lt;/path&gt;&#xA;      &lt;path d=&#34;M3.9 12c0-1.71 1.39-3.1 3.1-3.1h4V7H7c-2.76.0-5 2.24-5 5s2.24 5 5 5h4v-1.9H7c-1.71.0-3.1-1.39-3.1-3.1zM8 13h8v-2H8v2zm9-6h-4v1.9h4c1.71.0 3.1 1.39 3.1 3.1s-1.39 3.1-3.1 3.1h-4V17h4c2.76.0 5-2.24 5-5s-2.24-5-5-5z&#34;&gt;&lt;/path&gt;&#xA;    &lt;/svg&gt;&lt;/a&gt;&#xA;&lt;/h2&gt;&#xA;&lt;p&gt;Much of the previous challenges in managing the technical operations at the data trust stemmed from a lack of understanding of the scope and extent of effort for a given piece of work and having no barometer to measure productivity (or the lack of it). This meant that everyone knew that a given piece of work took 1 month to complete, everyone agreed that this delay was not acceptable,but no one really could pinpoint where the bottlenecks were and why they existed.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Plunging into the Data Trust black box, and Deep Cleaning the System</title>
      <link>/blog/series/crdt-telenovela-series/2023-05-20-plunging-into-data-trust/</link>
      <pubDate>Sat, 20 May 2023 00:00:00 +0000</pubDate>
      <guid>/blog/series/crdt-telenovela-series/2023-05-20-plunging-into-data-trust/</guid>
      <description>&lt;h2 id=&#34;diving-into-the-world-of-administrative-data-and-crdt&#34;&gt;Diving into the world of administrative data and CRDT&#xA;  &lt;a href=&#34;#diving-into-the-world-of-administrative-data-and-crdt&#34;&gt;&lt;svg class=&#34;anchor-symbol&#34; aria-hidden=&#34;true&#34; height=&#34;26&#34; width=&#34;26&#34; viewBox=&#34;0 0 22 22&#34; xmlns=&#34;http://www.w3.org/2000/svg&#34;&gt;&#xA;      &lt;path d=&#34;M0 0h24v24H0z&#34; fill=&#34;currentColor&#34;&gt;&lt;/path&gt;&#xA;      &lt;path d=&#34;M3.9 12c0-1.71 1.39-3.1 3.1-3.1h4V7H7c-2.76.0-5 2.24-5 5s2.24 5 5 5h4v-1.9H7c-1.71.0-3.1-1.39-3.1-3.1zM8 13h8v-2H8v2zm9-6h-4v1.9h4c1.71.0 3.1 1.39 3.1 3.1s-1.39 3.1-3.1 3.1h-4V17h4c2.76.0 5-2.24 5-5s-2.24-5-5-5z&#34;&gt;&lt;/path&gt;&#xA;    &lt;/svg&gt;&lt;/a&gt;&#xA;&lt;/h2&gt;&#xA;&lt;p&gt;Administrative data is messy is not much of an adage as much as it is a reality. When I took reins of managing the data infrastructure and analytical operations of Institute for Social Capital or ISC (now called the Charlotte Regional Data Trust) in the middle of 2021, messiness extended beyond data. The dysfunction was deep in how data was collected and organized, the way data operations and analyses were conducted, how information was collected from stakeholders, and how data was disseminated.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Towards reproducible data science for community and policy research - An experiential roadmap</title>
      <link>/talk/towards-reproducible-data-science/</link>
      <pubDate>Mon, 06 Mar 2023 00:00:00 +0000</pubDate>
      <guid>/talk/towards-reproducible-data-science/</guid>
      <description>On developing a reproducible data science framework and practice at the Charlotte Urban Insitute</description>
    </item>
    <item>
      <title>UI Reproducibility Project</title>
      <link>/project/ui-reproducibility-project/</link>
      <pubDate>Sat, 31 Dec 2022 00:00:00 +0000</pubDate>
      <guid>/project/ui-reproducibility-project/</guid>
      <description>&lt;div id=&#34;&#34; class=&#34;panelset&#34;&gt;&#xA;  &#xD;&#xA;&lt;div class=&#34;panel&#34;&gt;&#xA;  &lt;div class=&#34;panel-name&#34;&gt;Summary&lt;/div&gt;&#xA;  &#xA;  &lt;p&gt;&#xA;&#xA;&#xA;&#xA;&lt;h5 id=&#34;background&#34;&gt;Background&#xA;  &lt;a href=&#34;#background&#34;&gt;&lt;/a&gt;&#xA;&lt;/h5&gt;&#xA;&lt;p&gt;Diverse research backgrounds, skills and operational practices make our institute versatile and nimble to address research problems that crosses several domains. But they also enabled research analytical practices to remain fragmented and inefficient.&lt;/p&gt;&#xA;&lt;p&gt;The Urban Institute data science team recognized the significance of reproducibility in analytical community research practice on two distinct contexts. 1) Operational efficiency via streamlined use and reuse of data, analytical tools and assets 2) developing a culture of transparency and trust that underpins reproducible research whose products become fully replicable and auditable.&lt;/p&gt;</description>
    </item>
    <item>
      <title>UI Data and Analytics Guide</title>
      <link>/project/ui-data-analytics-guide/</link>
      <pubDate>Wed, 03 Aug 2022 00:00:00 +0000</pubDate>
      <guid>/project/ui-data-analytics-guide/</guid>
      <description>&lt;div id=&#34;&#34; class=&#34;panelset&#34;&gt;&#xA;  &#xD;&#xA;&lt;div class=&#34;panel&#34;&gt;&#xA;  &lt;div class=&#34;panel-name&#34;&gt;Summary&lt;/div&gt;&#xA;  &#xA;  &lt;p&gt;&#xA;&#xA;&#xA;&#xA;&lt;h5 id=&#34;objectives-and-scope&#34;&gt;Objective(s) and Scope&#xA;  &lt;a href=&#34;#objectives-and-scope&#34;&gt;&lt;/a&gt;&#xA;&lt;/h5&gt;&#xA;&lt;p&gt;The project aims to create a comprehensive guide to all operational processes of the the Urban Institute, serving as a primary point of reference for all research staff in managing data and analytical resources of the institute.&lt;/p&gt;&#xA;&lt;p&gt;The manual will be created using Rmarkdown, a tool that allows for the creation of rich, interactive documents. The manual will be hosted as a website that can be easily updated and maintained by team members.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Charlotte Regional Data Trust - Technical Operations Manual</title>
      <link>/talk/tech-operations-manual/</link>
      <pubDate>Sat, 06 Aug 2022 00:00:00 +0000</pubDate>
      <guid>/talk/tech-operations-manual/</guid>
      <description>On how we developed the technical operations manual at the Charlotte Regional Data Trust</description>
    </item>
    <item>
      <title>CRDT Data Documentation Project</title>
      <link>/project/crdt-data-documentation-project/</link>
      <pubDate>Thu, 03 Mar 2022 00:00:00 +0000</pubDate>
      <guid>/project/crdt-data-documentation-project/</guid>
      <description>&lt;div id=&#34;&#34; class=&#34;panelset&#34;&gt;&#xA;  &#xD;&#xA;&lt;div class=&#34;panel&#34;&gt;&#xA;  &lt;div class=&#34;panel-name&#34;&gt;Summary&lt;/div&gt;&#xA;  &#xA;  &lt;p&gt;&#xA;&#xA;&#xA;&#xA;&lt;h5 id=&#34;objectives-and-scope&#34;&gt;Objective(s) and Scope&#xA;  &lt;a href=&#34;#objectives-and-scope&#34;&gt;&lt;/a&gt;&#xA;&lt;/h5&gt;&#xA;&lt;p&gt;The project seeks to enhance the quality and completeness of CRDT&amp;rsquo;s data documentation, and establish a centralized and organized infrastructure for storing and managing metadata.&lt;/p&gt;&#xA;&lt;p&gt;The project includes reviewing the existing metadata, developing standardized data documentation (metadata, codebook, data dictionary), and implementing a data infrastructure for storing and organizing metadata for all databases.&lt;/p&gt;&#xA;&#xA;&#xA;&#xA;&#xA;&lt;h5 id=&#34;expected-outcomes&#34;&gt;Expected Outcomes&#xA;  &lt;a href=&#34;#expected-outcomes&#34;&gt;&lt;/a&gt;&#xA;&lt;/h5&gt;&#xA;&lt;ul&gt;&#xA;&lt;li&gt;Improved data documentation quality and completeness.&lt;/li&gt;&#xA;&lt;li&gt;Consistent and standardized data documentation across all databases.&lt;/li&gt;&#xA;&lt;li&gt;A centralized and organized infrastructure for storing and accessing metadata.&lt;/li&gt;&#xA;&lt;/ul&gt;&#xA;&lt;/p&gt;</description>
    </item>
    <item>
      <title>What&#39;s work like?</title>
      <link>/blog/posts/2021-04-18-what-s-work-like/</link>
      <pubDate>Mon, 12 Apr 2021 00:00:00 +0000</pubDate>
      <guid>/blog/posts/2021-04-18-what-s-work-like/</guid>
      <description>&lt;h2 id=&#34;dust-settles&#34;&gt;Dust Settles&#xA;  &lt;a href=&#34;#dust-settles&#34;&gt;&lt;svg class=&#34;anchor-symbol&#34; aria-hidden=&#34;true&#34; height=&#34;26&#34; width=&#34;26&#34; viewBox=&#34;0 0 22 22&#34; xmlns=&#34;http://www.w3.org/2000/svg&#34;&gt;&#xA;      &lt;path d=&#34;M0 0h24v24H0z&#34; fill=&#34;currentColor&#34;&gt;&lt;/path&gt;&#xA;      &lt;path d=&#34;M3.9 12c0-1.71 1.39-3.1 3.1-3.1h4V7H7c-2.76.0-5 2.24-5 5s2.24 5 5 5h4v-1.9H7c-1.71.0-3.1-1.39-3.1-3.1zM8 13h8v-2H8v2zm9-6h-4v1.9h4c1.71.0 3.1 1.39 3.1 3.1s-1.39 3.1-3.1 3.1h-4V17h4c2.76.0 5-2.24 5-5s-2.24-5-5-5z&#34;&gt;&lt;/path&gt;&#xA;    &lt;/svg&gt;&lt;/a&gt;&#xA;&lt;/h2&gt;</description>
    </item>
  </channel>
</rss>
