Navy Sources Sought: Global Historical Archive of Social Media Data

Notice ID:  1301177121

This project is part of ongoing research efforts conducted through the Department of Defense and Analysis at the Naval Postgraduate School.  Our research aims to provide enhanced understanding of fundamental social dynamics, to model the dynamics of influence in the information environment, over time and across countries.

Scope:  To support this research, we seek to acquire a global historical archive of social media data, providing the full text of all available public online media posts, or an unbiased sample of all available public online media posts, across all countries and languages, across multiple social media platforms.

The contractor shall be responsible for providing the following deliverables:

  • Record Requirements
  • Platforms Covered: The archive must provide records from at least 3 global social media platforms, each with at least 200 million unique registered users per platform.
  • Minimum Time Period Covered: 6 months. Time period covered by the archive must begin no later than 10/1/2021 and must end no sooner than 3/31/2022.
  • Minimum Number of Total Records: 6 billion
  • Sampling rate: The archive must provide at least 5% of the total volume of publicly available messages from each of the sampled platforms.
  • Sampling mechanism: If less than 100% of publicly available messages from a given platform will be included, proposals must include a description of how sampling will be performed, including whether sampling was conducted fully randomly, or through some other means, and must confirm that the sampling does not rely on any systematic filtering or exclusions based on message content, search terms, language, or geography.
  • Each record in the archive must provide the full text of an online social media post, unaltered from its original content and formatting, with all publicly available meta-data, including country, language, hashtags, location, handle, timestamp, and URLs, that were associated with the original posting. Proposals must include a full listing of the fields that will be included from each platform.
  • Approximate location information, such as self-reported user hometowns, or other publicly available geolocation information, must be included for at least 20% of the records.
  • All records must consist exclusively of publicly available information. No private communications or private user data will be included.
  • All data must be collected and delivered in compliance with applicable regulations, including the GDPR and CCPA, and in compliance with the Terms of Service of the recorded platforms.

Read more here.

Ad



Not Yet a Premium Partner/Sponsor? Learn more about the OS AI Premium Corporate and Individual Plans here. Plans start at $250 annually.

How useful was this post?

Click on a star to rate it!

We are sorry that this post was not useful for you!

Let us improve this post!

Tell us how we can improve this post?

LEAVE A REPLY

Please enter your comment!
Please enter your name here