Search Results

2018 Texas Sentate Debate Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the United States Senate race between Beto O'Rourke and Ted Cruz. This dataset contains Tweets captured around their first debate on September 21, 2018. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 3,006,198 Tweets and 101,050 media files make up the combined dataset.
Date: 2018-09-12/2018-10-03
Creator: Phillips, Mark Edward

Corpus of News on the Web (NOW) - April 2018

Description: Dataset of words collected from newspapers and magazines from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: April 2018
Creator: Davies, Mark

Corpus of News on the Web (NOW) - March 2018

Description: Dataset of words collected from newspapers and magazines from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: March 2018
Creator: Davies, Mark

Corpus of News on the Web (NOW) - May 2018

Description: Dataset of words collected from newspapers and magazines from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: May 2018
Creator: Davies, Mark

Gaming Census Dataset

Description: This dataset represents survey feedback gathered about games in libraries, collections, cataloging, outreach, and programming.
Date: December 3, 2018
Creator: Brannon, Sian; Robson, Diane & Dewitt-Miller, Erin

Hurricane Florence Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to Hurricane Florence and the subsequent flooding along the Carolina coastal region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 4,971,575 Tweets and 347,205 media files make up the combined dataset.
Date: 2018-09-05/2018-10-03
Creator: Phillips, Mark Edward

Labeled PDF Dataset from End of Term (EOT) 2008 Web Archive

Description: This dataset contains a random sample of 2000 PDF documents from the usda.gov domain in the End of Term (EOT) 2008 Web Archive. These samples were categorized as being of interest for possible inclusion in the Technical Report Archive and Image Library (TRAIL). Each PDF has been sorted into two categories, Technical_Report and Not_Technical_Report.
Date: July 2018
Creator: Kirkwood, Patricia; Phillips, Mark Edward & Caldwell, Christopher

University of North Texas Libraries Serials Transparency List

Description: Dataset containing information regarding subscriptions purchased by UNT Libraries, along with pricing information for the 2013-14, 2014-15, and 2015-16 fiscal years.
Date: April 2018
Creator: University of North Texas. Libraries. Collection Development.

UNT Scholarly Works PDF Dataset

Description: This dataset contains a set of 4,534 PDF files from the UNT Scholarly Works collection, the institutional repository for UNT in the UNT Digital Library.
Date: September 12, 2018
Creator: Phillips, Mark Edward
Back to Top of Screen