Data Management Archives - data.org http://data.org/topic/data-management/ Tue, 25 Jul 2023 20:56:55 +0000 en-US hourly 1 https://wordpress.org/?v=6.4.2 https://data.org/wp-content/uploads/2021/07/cropped-favicon-test-32x32.png Data Management Archives - data.org http://data.org/topic/data-management/ 32 32 Selecting the Right Data Analytics Tools for Your Nonprofit https://data.org/resources/selecting-the-right-data-analytics-tools-for-your-nonprofit/ Fri, 10 Mar 2023 18:26:14 +0000 https://data.org/?post_type=resource&p=16499 Overview People need technology to make sense of data and to present it in a digestible way. That’s where web and app analytics tools come in. The tools described in this resource gather and visualize data to help people make meaningful decisions.

The post Selecting the Right Data Analytics Tools for Your Nonprofit appeared first on data.org.

]]>
Overview

People need technology to make sense of data and to present it in a digestible way. That’s where web and app analytics tools come in. The tools described in this resource gather and visualize data to help people make meaningful decisions.

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post Selecting the Right Data Analytics Tools for Your Nonprofit appeared first on data.org.

]]>
Participatory Data Stewardship https://data.org/resources/participatory-data-stewardship/ Mon, 06 Feb 2023 21:30:49 +0000 https://data.org/?post_type=resource&p=15722 Overview Well-managed data can support social impact organizations (SIOs) as they conduct lifesaving health research, reduce environmental threats, and produce societal value for individuals and communities. But these benefits are often overshadowed by harms, as current practices in data collection, storage, sharing, and use have led to high-profile misuses of…

The post Participatory Data Stewardship appeared first on data.org.

]]>
Overview

Well-managed data can support social impact organizations (SIOs) as they conduct lifesaving health research, reduce environmental threats, and produce societal value for individuals and communities. But these benefits are often overshadowed by harms, as current practices in data collection, storage, sharing, and use have led to high-profile misuses of personal data, data breaches, and sharing scandals. This report provides a practical framework and shares case studies to demonstrate how citizens can participate in shaping and improving ways data is collected and used. 

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post Participatory Data Stewardship appeared first on data.org.

]]>
11 Steps to Collect and Leverage High-Quality Survey Data https://data.org/resources/11-steps-to-collect-and-leverage-high-quality-survey-data/ Wed, 23 Nov 2022 19:45:46 +0000 https://data.org/?post_type=resource&p=14333 Overview Surveys are an important form of data collection to support program improvement and storytelling. We also know it can be challenging to create, disseminate, and analyze surveys amid all your other priorities. This resource shares eleven steps to collect and leverage high-quality survey data.

The post 11 Steps to Collect and Leverage High-Quality Survey Data appeared first on data.org.

]]>
Overview

Surveys are an important form of data collection to support program improvement and storytelling. We also know it can be challenging to create, disseminate, and analyze surveys amid all your other priorities. This resource shares eleven steps to collect and leverage high-quality survey data.

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post 11 Steps to Collect and Leverage High-Quality Survey Data appeared first on data.org.

]]>
Collecting and Tracking Virtual Engagement Data https://data.org/resources/collecting-and-tracking-virtual-engagement-data/ Wed, 23 Nov 2022 19:45:44 +0000 https://data.org/?post_type=resource&p=14328 Overview Collecting data is essential to determine your organization’s impact and reach, but collecting data remotely can be extremely difficult during virtual programming sessions. With this in mind, it is important to have a plan of action to track and measure success. This resource shares key areas to remember when…

The post Collecting and Tracking Virtual Engagement Data appeared first on data.org.

]]>
Overview

Collecting data is essential to determine your organization’s impact and reach, but collecting data remotely can be extremely difficult during virtual programming sessions. With this in mind, it is important to have a plan of action to track and measure success. This resource shares key areas to remember when planning to collect data on virtual programs.

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post Collecting and Tracking Virtual Engagement Data appeared first on data.org.

]]>
3 Key Challenges to Data Cleaning in Digital Development Programs https://data.org/resources/3-key-challenges-to-data-cleaning-in-digital-development-programs/ Wed, 28 Sep 2022 13:18:44 +0000 https://data.org/?post_type=resource&p=13952 Overview In the last decade, incredible improvements have been made on the data collection side that helps the processing pipeline. However, various datasets are managed and maintained by several organizations in disconnected systems. This causes some information about the data to be lost during this transition, and people doing the cleaning have…

The post 3 Key Challenges to Data Cleaning in Digital Development Programs appeared first on data.org.

]]>
Overview

In the last decade, incredible improvements have been made on the data collection side that helps the processing pipeline. However, various datasets are managed and maintained by several organizations in disconnected systems. This causes some information about the data to be lost during this transition, and people doing the cleaning have no control over the collection. The solutions to data cleaning challenges that arise demand varying degrees of personnel hours, which in some cases could be avoided through automation.

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post 3 Key Challenges to Data Cleaning in Digital Development Programs appeared first on data.org.

]]>
3 Key Steps to a Successful Data Commons https://data.org/guides/3-key-steps-to-a-successful-data-commons/ Fri, 27 May 2022 11:00:00 +0000 https://data.org/?post_type=guide&p=11410 Introduction Data commons collocate data with cloud computing infrastructure and software services, applications, and tools to create powerful resources for the large-scale management, analysis, harmonization, and sharing of data.  Unlike data warehouses, data lakes, and other systems that support an organization’s business analytics, a data commons is focused on providing…

The post 3 Key Steps to a Successful Data Commons appeared first on data.org.

]]>
Introduction

Data commons collocate data with cloud computing infrastructure and software services, applications, and tools to create powerful resources for the large-scale management, analysis, harmonization, and sharing of data.  Unlike data warehouses, data lakes, and other systems that support an organization’s business analytics, a data commons is focused on providing a resource for a community or collaboration, or in support of multiple communities. 

The term data commons derives from the more general concept of a “commons,” a term that was popularized by Elinor Ostrom, who defined it as a natural, cultural, or digital resource accessible to all members of a community, or, more broadly, of a society. An example is a pasture for animals to graze in a village. These resources are held in common, through a partnership, a not-for-profit, or another entity, but not owned privately for commercial gain [1].

An example of a data commons is the UK Biobank; a large-scale biomedical database with 500,000 participants who consent for their data to be used for the specific purpose of bonafide research on the diagnosis, prevention, and/or treatment of serious and life-threatening illnesses. Evidently, data commons can be powerful catalysts to innovation and transformation of the social impact sector especially when collaborative and synergistic action is required in the efforts to strengthen epidemic preparedness and response, financial inclusion, and enabling access to opportunity.

Data commons can be very challenging to set up and more so to maintain successfully. In this guide, we will share more examples of working data commons and three essential steps to maximize chances of success.

In this Guide
  • Get inspired: explore examples of existing data commons
  • Define your why and how: Understand the importance of governance in data commons
  • Bootstrap: Investigate technology platforms for data commons
  • Plan for the long term: Consider issues of data ingest and harmonization

Data Commons in Practice

Data Commons have been developed to address a diverse set of data sharing and research needs.  The following are some examples of data commons:

Step 1: Governance and Agreements

Establishing effective governance for a data commons is a critical factor for success. Specific considerations will depend on the nature of the data hosted by the commons and the community it supports and may include: agreements for contributing data, permissible use of data, intellectual property rights, publishing and citation guidelines, and operational principles. It’s advisable to look for specialized guidelines related to the sector where you will be getting your data (for example Health Information Exchange is a relevant standard if your platform focuses on health). It is important to do your research and identify the standards that apply to your data scope. Below are some general resources that can help you in this step:

Step 2: Choosing a Platform

Underlying a data commons is a software infrastructure that manages access to data storage, imposes structure on data, and may offer analysis and/or visualization tools. A data commons platform may support some level of interoperability with other data commons, allowing for possible participation in a broader data ecosystem or data mesh. Examples of software platforms for building data commons include:

Gen3 – a general-purpose open-source platform supporting ad hoc analysis 

Terra – a platform for biomedical data

Figshare – a general data platform for storing, sharing, and discovering research data

Dataverse – open-source research data repository software

Step 3: Getting the data

Successful commons curate and harmonize the data and produce data products of broad interest to the community. It’s time-consuming, expensive, and labor-intensive to curate and harmonize data; much of the value of data commons is centralizing this effort so that it can be done once instead of many times by each group that needs the data.  Here are some useful resources that described principles, standards, and best practices for working with research data in data commons:

The FAIR data principles

The CARE Principles for Indigenous Data Governance

Research Data Alliance

Conclusion

Data commons support a community’s management, analysis, and sharing of data.  For this reason, data commons require a governance framework that supports the community’s values and goals.  Furthermore, these frameworks are key to ensuring resources are used for the specific purposes for which they were made available. This, in turn, enables more resources to be safely mobilized for the benefit of the community. Successful data commons tend to carefully curate and harmonize the data they contain, which reduces the time and effort required for users to analyze the data, especially when the data comes from multiple sources. This is especially helpful for emergency research and innovation. 

Please feel free to suggest any other guides you found helpful by contacting us and we may incorporate them.

[1] Elinor Ostrom. Governing the commons: The evolution of institutions for collective action. Cambridge university press, 1990.

Grateful for the contribution of Robert Grossman, Ph.D., Frederick H. Rawson Distinguished Service Professor of Medicine and Computer Science, and the Jim and Karen Frank Director of the Center for Translational Data Science (CTDS) at the University of Chicago.

Join Our Community

Make connections with other social impact organizations and receive a curated listing of community and data.org events and opportunities.

By submitting your information and clicking “Submit”, you agree to the data.org Privacy Policy and Terms and Conditions, and to receive email communications from data.org.

The post 3 Key Steps to a Successful Data Commons appeared first on data.org.

]]>
Humanitarian Data Ethics https://data.org/resources/humanitarian-data-ethics/ Mon, 17 Jan 2022 17:57:01 +0000 https://data.org/?post_type=resource&p=7357 Overview This resource focuses specifically on the ethical aspects of humanitarian data management, ranging from standard exercises such as field-level data collection and processing to more advanced applications of data science, such as predictive analytics. It addresses common ethical concerns with data management, including validity, bias and fairness, and privacy…

The post Humanitarian Data Ethics appeared first on data.org.

]]>
Overview

This resource focuses specifically on the ethical aspects of humanitarian data management, ranging from standard exercises such as field-level data collection and processing to more advanced applications of data science, such as predictive analytics. It addresses common ethical concerns with data management, including validity, bias and fairness, and privacy and anonymity.

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post Humanitarian Data Ethics appeared first on data.org.

]]>
Closing a Project: Archiving and Preservation of Content https://data.org/resources/closing-a-project-archiving-and-preservation-of-content/ Mon, 10 Jan 2022 21:38:33 +0000 https://data.org/?post_type=resource&p=5747 Overview Archiving is a general term for the range of practices and decisions that support the long-term preservation, use, and accessibility of content with enduring value. It is not a one-time action but a process and an investment that connects directly to an organization’s goals for project outcomes and impact.…

The post Closing a Project: Archiving and Preservation of Content appeared first on data.org.

]]>
Overview

Archiving is a general term for the range of practices and decisions that support the long-term preservation, use, and accessibility of content with enduring value. It is not a one-time action but a process and an investment that connects directly to an organization’s goals for project outcomes and impact. This resource provides a guide on archiving and preserving data when the project is entering its closing stages.

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post Closing a Project: Archiving and Preservation of Content appeared first on data.org.

]]>
Data Impact Assessments https://data.org/resources/data-impact-assessments/ Mon, 10 Jan 2022 21:35:05 +0000 https://data.org/?post_type=resource&p=5759 Overview This resource provides an introduction to Data Impact Assessments (DIAs) in humanitarian action. DIAs can help maximize benefits and minimize risks in operational data management. DIAs can also serve as a compliance mechanism with applicable laws, regulations, and internal policies and provide other guidance to reduce risks in data…

The post Data Impact Assessments appeared first on data.org.

]]>
Overview

This resource provides an introduction to Data Impact Assessments (DIAs) in humanitarian action. DIAs can help maximize benefits and minimize risks in operational data management. DIAs can also serve as a compliance mechanism with applicable laws, regulations, and internal policies and provide other guidance to reduce risks in data management.

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post Data Impact Assessments appeared first on data.org.

]]>
How to Build Your Data Stack https://data.org/guides/data-stack/ Mon, 10 Jan 2022 21:00:14 +0000 https://data.org/?post_type=guide&p=6372 Introduction Back in the early 90s, companies and organizations started investing in what was referred to as a webmaster — essentially the person that was in charge of all things having to do with the internet. As website needs became more complex and core to operations, we realized the internet…

The post How to Build Your Data Stack appeared first on data.org.

]]>
Introduction

Back in the early 90s, companies and organizations started investing in what was referred to as a webmaster — essentially the person that was in charge of all things having to do with the internet. As website needs became more complex and core to operations, we realized the internet wasn’t something that could be handled by one webmaster. Something similar has happened with data, where organizations invest in a “Data Person” to oversee all things data. But just like with the internet, collecting, storing, and analyzing data requires lots of different skills and tools that can’t be found in a single person.

The Data Stack is essentially what moves your data from individual ingredients stored in individual systems, to one cohesive data environment that is accessible and usable. It is often referred to as Data Engineering. This guide will walk you through the four major parts of a data stack and point to some of the things you need to know and consider as you build out your data infrastructure.

In this Guide
  • Ways to collect and share data in your organization.
  • Recommended data storage technologies and solutions.
  • Practices for keeping your data secure.
  • Ways to strategically use data in your organization.

Data Collection and Sharing

The first part of your data stack is all about how you get data. Some organizations have extensive data collection tools and practices and that is how they primarily think about getting data. Data sharing can be a great way to get access to data as well. Oftentimes someone has the data that you are looking for and you are able to negotiate agreements to get access to that data (and share your own).

Some resources on Data Collection and Data Sharing:

Data Storage and Transformation

Once you have all of your data ingredients, it is time to get them organized and ready for consumption. This step could be considered a  kind of data mise en place for those data-cooking nerds. This often is about moving data from a source system (such as your data collection tools or CRM) into a data warehouse where it can be accessed and analyzed more broadly and in conjunction with data from multiple sources. There are many vendors and tools that specialize in either ETL (Extract – Transform – Load) or ELT (Extract – Load – Transform) technologies that help move all of your data into one place, and several options for cloud providers who can store all of that data in one place.

Data Security

Don’t collect what you can’t protect is a useful motto to live by. Ensuring good data security tools and practices is an integral part of your data stack as an organization. Many social sector organizations are collecting sensitive data about vulnerable populations; we must do everything we can to ensure that we are protecting that information. This isn’t just about technology solutions, but also about training and equipping your staff to protect the data your organization collects. We often point to flaws in technology when data security issues arise, but it’s equally likely for human behavior to be the cause when staff is insufficiently trained, or targeted by social engineering.

Data Use

Once you have all of your data in one place and secure, you can start using analytics and business intelligence (BI) tools to turn your data into something valuable for your organization. There are many different options to consider when looking for BI & analytics solutions. A helpful frame is that the more technical skills people in your organization have, the more general the tools they can use. For example, if you have well-trained computer and data scientists on your team they’ll likely default to using tools like Python and R to interact with and manipulate large amounts of data. If your talent is more focused at the analyst level you will likely want to use tools like Tableau that are more visual in nature and more straightforward to use.

Conclusion

These resources will help you level up your organization’s data stack. This represents a maturation from a tool-based approach (i.e., I have and analyze my data in a set of tools) to take a more infrastructure-focused approach where you use tools more specifically to collect/ingest data, bring it together, and then analyze it from one place. As you invest more in this kind of capability, you are also increasing the value that you are able to get from your data. We also suggest reaching out to peer organizations, whose challenges and learning may provide direct and current examples of ways forward. Please feel free to suggest any other guides you found helpful by contacting us and we may incorporate them.

Join Our Community

Make connections with other social impact organizations and receive a curated listing of community and data.org events and opportunities.

By submitting your information and clicking “Submit”, you agree to the data.org Privacy Policy and Terms and Conditions, and to receive email communications from data.org.

Related Content

The post How to Build Your Data Stack appeared first on data.org.

]]>
Using your Data Responsibly and Ethically https://data.org/guides/using-your-data-responsibly-and-ethically/ Mon, 10 Jan 2022 18:11:33 +0000 https://data.org/?post_type=guide&p=5659 Introduction It seems like every week there are new data breaches, concerns about our privacy, and stories about data science gone wrong. We hear about banks giving credit to customers they know can’t afford it, casinos using data science and behavioral economics to keep customers coming back for more and…

The post Using your Data Responsibly and Ethically appeared first on data.org.

]]>
Introduction

It seems like every week there are new data breaches, concerns about our privacy, and stories about data science gone wrong. We hear about banks giving credit to customers they know can’t afford it, casinos using data science and behavioral economics to keep customers coming back for more and driving them into bankruptcy, and on and on…

These kinds of behaviors aren’t limited to private sector actors in search of greater profits. Nonprofits can also be irresponsible with their data, or fall victim to unintended consequences. Blackbaud suffered a data breach and ended up paying ransom to hackers that were in their system for months. The United Nations shared data about Rohingya refugees without their consent, possibly increasing their vulnerability in returning to Myanmar.

In this Guide
  • Simple, practical tools and principles to help your organization use data responsibly. 
  • Checklist created by the DrivenData team for data scientists to assess the risk of a project at the outset.
  • A tool developed by IDEO to support ethical design with data.
  • How Salesforce developed the first office of Ethical & Humane Use.
  • Help an organization develop a mindset for collecting and using data responsibly.

What does responsible data mean? What is irresponsible data?

What do we mean by responsible use? The Internet Society defines it as “applying ethical principles of transparency, fairness, and respect to how we treat the data that affects people’s lives”. The Responsible Data project defines it as “the collective duty to account for unintended consequences of working with data by prioritizing people’s rights to consent, privacy, security, and ownership when using data and implementing values and practices of transparency and openness”.

Think of responsible use as ensuring that our data practices align to our values. For example, many nonprofit organizations work with what would be described as vulnerable people. Our data efforts should never increase that vulnerability. We shouldn’t collect information that we are unable to protect. Here are some helpful resources on responsible data.

Data Science Projects

While the potential ethical issues that might arise when using data science and artificial intelligence have certainly been in the popular press recently, there has not been as much discussion with respect to how a data science team should incorporate ethics within a project. To help ensure ethics is considered during a data science project, below are resources to identify and address data science ethical conundrums.

Design Principles

Design firm IDEO has developed 4 Ethical Data Design Principles to help those developing data tools. The principles are useful guardrails and reminders as you’re using data to develop products and analysis. They’ve also created a series of Blind Spot Checks that can be used during group brainstorming design sessions to ensure your design process is responsible.

Ethical and Humane Use Leadership

Salesforce developed an Office of Ethical & Humane Use to develop principles and practices for ensuring that their technology is being used in ways that align with the company’s values. It has enabled employees to report unethical uses by customers, developed a toolkit to help teams build with intention, and pushed other technology companies to develop similar offices. 

Conclusion

The resources above will help you implement responsible data use for your organizations. We also suggest reaching out to peer organizations, whose challenges and learning may provide direct and current examples of ways forward. Please feel free to suggest any other guides you found helpful by contacting us and we may incorporate them.

Join Our Community

Make connections with other social impact organizations and receive a curated listing of community and data.org events and opportunities.

By submitting your information and clicking “Submit”, you agree to the data.org Privacy Policy and Terms and Conditions, and to receive email communications from data.org.

Related Content

Related Guides

The post Using your Data Responsibly and Ethically appeared first on data.org.

]]>
The DataKind Playbook https://data.org/resources/the-datakind-playbook/ Mon, 10 Jan 2022 17:22:47 +0000 https://data.org/?post_type=resource&p=5885 Overview The DataKind Playbook is a globally-accessible, living knowledge base that enables users to design, implement, and manage a Data Science and AI for Good project following DataKind’s proven approach. Created by DataKind’s staff and volunteers, the Playbook provides direction on how to approach a Data Science and AI for Good…

The post The DataKind Playbook appeared first on data.org.

]]>
Overview

The DataKind Playbook is a globally-accessible, living knowledge base that enables users to design, implement, and manage a Data Science and AI for Good project following DataKind’s proven approach. Created by DataKind’s staff and volunteers, the Playbook provides direction on how to approach a Data Science and AI for Good project and to do it well. It also codifies the lessons learned from failures and successes for use on future projects.

Do you have feedback on this resource?

Thank you for your feedback as we strive to curate and publish resources to help social impact organizations succeed with data.

Send us a note

Explore More

Related Guides & Resources

The post The DataKind Playbook appeared first on data.org.

]]>