The difference in number is because Twitter occasionally include duplicates (the exact same tweet) in the data. To keep the script running quick these aren’t removed but there is a TAGS menu option to remove duplicates or you can use other tools to clean the data