Sentiment data starts out as plain social media posts aggregated across all the major social media platforms. It is then run through one or many machine learning models, typically NLP models, that try to understand what is being said in the social media post. You can then aggregate this data