Learning Objective IOC-1.E:

Explain how people participate in problem-solving processes at scale. </font>

Essential Knowledge:

  • IOC-1.E.1: Widespread access to information and public data facilitates the identification of problems, development of solutions, and dissemination of results.
  • IOC-1.E.2: Science has been affected by using distributed and “citizen science” to solve scientific problems.
  • IOC-1.E.3: Citizen science is scientific research conducted in whole or part by distributed individuals, many of whom may not be scientists, who contribute relevant data to research using their own computing devices.
  • IOC-1.E.4: Crowdsourcing is the practice of obtaining input or information from a large number of people via the Internet.
  • IOC-1.E.5: Human capabilites can be enhanced by collaboration via computing.
  • IOC-1.E.6: Crowdsourcing offers new models for collaboration, such as connecting businesses or social causes with funding.

</font>

What is crowdsourcing?

  • Crowdsourcing, as an English word, can be defined as the practice of obtaining information or input into a task or project by enlisting the services of a large number of people, either paid or unpaid.

    • This is typically done via the internet.
    • A few examples of projects that utilize crowdsourcing are Wikipedia and Fiverr, as they rely on input from internet users in order to display information or advertise independent services. Furthermore, companies like companies like LEGO, PepsiCo, Starbucks and Fisher-Price all utilize crowdsourcing to recieve input from their customers and create better products/enhance consumer experience.

Crowdsourcing

Popcorn Hack 1

Q: Name another example of a project or company that could use crowdsourcing, and why. The example can be hypothetical or an already existing project/company.

Enter your answer here: Phone apps - many phone apps usually ask you to rate their app from 1 to 5 stars so that based off of that feedback they can make changes to their app.

Public Data Sets

  • Public data sets are defined as public data organized around a theme or topic that are generally accessible to the public (usually in an online format).
    • This relates to IOC-1.E.1: Widespread access to information and public data sets faciliates the identification of problems, development of solutions, and dissemination of results.

Examples of public data sets:

  • Kaggle:
    • Courses in visualization, TensorFlow, AI, machine learning, and more!
    • Competitions - cash prizes, swag, props
    • Notebooks - code, data, notes
  • Google Public Datasets:
    • Free for the 1st terabyte
    • All-in-one package that uses BigQuery - Library and toolset that for SQL
  • Data.gov:
    • US Government data
    • Agriculture, energy, water, health
  • Your local government:
    • Keeps them accountable
    • Help find solutions and trends in your area

Distributed Computing

  • Literally! According to Amazon Web Services (AWS), distributed computing can be defined as the method of making multiple computers work together to solve a common problem. A few examples can be seen below:
    • Folding@Home - Stanford University:
      • Protein folding: Proteins are long chains of amino acids, and they tend to fold a certain way. But sometimes, they can fold the wrong way, leading to problems like cancer, Alzheimer’s disease and Huntington’s disease. To help fight this, project Folding@Home sends bits to volunteers’ computers all around the world to distribute the computing power and increase processing efficiency. This came with the PlayStation 3 back in the day.
    • BOINC - University of California, Berkeley:
      • A variety of projects: Mathematics, astrophysics, climate, and Minecraft@Home.
    • There are many other examples of distributed computing in the world, but these are the more popular ones.
CollegeBoard learning objectives: - IOC-1.E.2: Science has been affected by using distributed and "citizen science" to solve scientific problems. - IOC-1.E.3: Citizen science is scientific research conducted in whole or part by distributed individuals, many of whom may not be scientists, who contribute relevant data to research using their own computing devices. - IOC-1.E.5: Human capabilites can be enhanced by collaboration via computing.

Crowdsourcing

Popcorn Hack 2

Q: Based on the examples provided above, what other heavy tasks can distributed computing be used for? Write your answer in the space provided.</font>

Enter your answer here: AI like chatGPT

Innovations made possible with crowdsourcing

  • Spotify:
    • Collaborative playlists
    • User data-based music algorithm (one of the better ones out there)
    • Metadata write-in (write descriptions for songs that are peer-approved - similar to Wikipedia in a way)
  • Crowdfunding:
    • Kickstarter - You can pitch an idea on the platform; people can donate money to help develop your idea or create your product
    • IndieGoGo - More for non-profit, philanthropic programs
    • Patreon - One of the most popular subscription-based funding programs to help creators/developers with their journey
    • GoFundMe - One more example of a popular website where you can donate money for a good cause
  • Blockchain:
    • Mainly used as a cryptocurrency wallet
    • It can be used for securing things like concert tickets as well - done through a decentralized computing network (every computer is doing their part to ensure that each block in the chain is secure relative to the ones before and after it)
CollegeBoard learning objectives: - IOC-1.E.4: Crowdsourcing is the practice of obtaining input or information from a large number of people via the Internet. - IOC-1.E.6: Crowdsourcing offers new models for collaboration, such as connecting businesses or social causes with funding.

Homework - with a twist!

Debate: Ethics of crowdsourcing - Positives and negatives of crowdsourcing</font>

Debate:

  • crowdsourcing can lead to low quality informatiom
  • But google maps takes lots of info and can still provide lots of quality info
  • sometimes you need bad review such as for like resturaunts so that you know whether something is quality or not
  • audiences could be bias and could provide misinformation
  • only people who want to share information will be recorded so it could lead to biased information
  • however, if you take information from a wide population or get lots of data, the bias would cancel eachother out.

After the debate, I remain in my position that corwdsourcing has more pros than cons. the other side made the argument that crowdsourcing can lead to low quality information, misinformation, and biased information but this is largely the case when someone is getting their data from a small group. If you get enough data from a large range of people and groups, you get unbiased data and more data that is helpful. With more people providing info, we can find solutions to different problems and be able to see problems or any other things in lots of different views.