Classification and Analysis of Personal and Commercial CBD Tweets

Jason S. Turner, Mehmed M. Kantardzic, Rachel Vickers-Smith

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review


This study analyzes the differences in terms regarding cannabidiol (CBD) expressed by commercial sellers and personal users on Twitter. It demonstrates that data from social networks can be used by public health and medical researchers to compare the medical conditions targeted by those selling loosely-regulated substances such as CBD against the medical conditions that patients themselves are commonly treating with CBD. We collected 567,850 tweets by searching Twitter with the Tweepy Python package using the terms CBD and cannabidiol, and annotated a sample of 5,496 tweets to distinguish between personal use CBD tweets and commercial/sales-related CBD tweets. We used this sample to train two binary text classifiers to create two corpora of 169,876 personal use and 148,866 commercial/sales. Using medical, standard, and slang dictionaries, we then identified and compared the most frequently occurring medical conditions, symptoms, side effects, body parts, and other substances referenced in both corpora.

Original languageEnglish
Title of host publicationMediterranean Forum – Data Science Conference - First International Conference, MeFDATA 2020, Revised Selected Papers
EditorsJasminka Hasic Telalovic, Mehmed Kantardzic
Number of pages12
StatePublished - 2021
Event1st Mediterranean Forum - Data Science Conference, MeFDATA 2020 - Virtual, Online
Duration: Oct 24 2020Oct 24 2020

Publication series

NameCommunications in Computer and Information Science
Volume1343 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937


Conference1st Mediterranean Forum - Data Science Conference, MeFDATA 2020
CityVirtual, Online

Bibliographical note

Publisher Copyright:
© 2021, Springer Nature Switzerland AG.


  • Cannabis
  • Text classification
  • Text mining

ASJC Scopus subject areas

  • General Computer Science
  • General Mathematics


Dive into the research topics of 'Classification and Analysis of Personal and Commercial CBD Tweets'. Together they form a unique fingerprint.

Cite this