A dataset containing data behind the study "FakeNewsNet: A Data Repository with News Content, Social Context and Spatialtemporal Information for Studying Fake News on Social Media" https://arxiv.org/abs/1809.01286. The news articles in this dataset were posted to Facebook in September 2016, in the run-up to the U.S. presidential election.
A data frame with 150 rows and 6 variables:
The title of the news article
Text of the article
Hyperlink for the article
Authors of the article
Binary variable indicating whether the article presents fake or real news(fake, real)
Number of words in the title
Number of words in the text
Number of characters in the title
Number of characters in the text
Number of words that are all capital letters in the title
Number of words that are all capital letters in the text
Percent of words that are all capital letters in the title
Percent of words that are all capital letters in the text
Number of characters that are exclamation marks in the title
Number of characters that are exclamation marks in the text
Percent of characters that are exclamation marks in the title
Percent of characters that are exclamation marks in the text
Binary variable indicating whether the title of the article includes an exlamation point or not(TRUE, FALSE)
Percent of words that are associated with anger
Percent of words that are associated with anticipation
Percent of words that are associated with disgust
Percent of words that are associated with fear
Percent of words that are associated with joy
Percent of words that are associated with sadness
Percent of words that are associated with surprise
Percent of words that are associated with trust
Percent of words that have negative sentiment
Percent of words that have positive sentiment
Number of syllables in text
Number of syllables per word in text
Shu, K., Mahudeswaran, D., Wang, S., Lee, D. and Liu, H. (2018) FakeNewsNet: A Data Repository with News Content, Social Context and Dynamic Information for Studying Fake News on Social Media