简体   繁体   English

从推文中提取日期(Tweepy,Python)

[英]Extract date from tweets (Tweepy, Python)

I'm new to Python, and so I'm struggling a bit with this.我是 Python 的新手,所以我对此有点挣扎。 Basically, the code below gets the text of tweets with the hashtag bitcoin in it, and I want to extract the date and author as well as the text.基本上,下面的代码获取带有标签比特币的推文文本,我想提取日期和作者以及文本。 I've tried different things, but stuck rn.我尝试过不同的东西,但卡住了 rn。 Greatly appreciate any help with this.非常感谢对此的任何帮助。

import pandas as pd
import numpy as np
import tweepy

api_key = '*'
api_secret_key = '*'
access_token = '*'
access_token_secret = '*'

authentication = tweepy.OAuthHandler(consumer_key, consumer_secret_key)
auth.set_access_token(access_token, access_token_secret)
api = tweepy.API(authentication, wait_on_rate_limit=True)

#Get tweets about Bitcoin and filter out any retweets
search_term = '#bitcoin -filter:retweets'
tweets = tweepy.Cursor(api.search_tweets, q=search_term, lang='en', since='2018-11-01', tweet_mode='extended').items(50)
all_tweets = [tweet.full_text for tweet in tweets]


df = pd.DataFrame(all_tweets, columns=['Tweets'])
df.head()

If you use dir(tweet) then you see all variables and functions in object tweet如果你使用dir(tweet)那么你会在 object tweet中看到所有的变量和函数

author
contributors
coordinates
created_at
destroy
display_text_range
entities
extended_entities
favorite
favorite_count
favorited
full_text
geo
id
id_str
in_reply_to_screen_name
in_reply_to_status_id
in_reply_to_status_id_str
in_reply_to_user_id
in_reply_to_user_id_str
is_quote_status
lang
metadata
parse
parse_list
place
possibly_sensitive
retweet
retweet_count
retweeted
retweets
source
source_url
truncated
user

And there is created_at还有created_at

all_tweets = []

for tweet in tweets:
    #print('\n'.join(dir(tweet)))
    all_tweets.append( [tweet.full_text, tweet.created_at] )

df = pd.DataFrame(all_tweets, columns=['Tweets', 'Created At'])
df.head()

Result:结果:

                                           Tweets                Created At
0  @Ralvero Of course $KAWA ready for 100x 🚀#ETH ... 2022-03-26 13:51:06+00:00
1  Pairs:1INCHUSDT \n SELL:1.58500\n Time :3/26/2...  2022-03-26 13:51:06+00:00
2  @hotcrosscom @iSafePal 🌐 First LIVE Dapp: Cylu... 2022-03-26 13:51:04+00:00
3  @Justdoitalex @Isabel_Schnabel Finally a truth...  2022-03-26 13:51:03+00:00
4  #Bitcoin has rejected for the fourth time the ...  2022-03-26 13:50:55+00:00

But your code have problem with since because it seems it was removed in version 3.8但是你的代码有问题, since它似乎在 3.8 版中被删除了

See: Collect tweets in a specific time period in Tweepy, until and since doesn't work请参阅: 在 Tweepy 中收集特定时间段内的推文,直到和之后不起作用

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM