You are viewing a single comment's thread from:

RE: Hacking The Top 100 New STEEM Posts - First Text Mining Analysis for Fun & Profit!

in #money8 years ago

Awesome, upvoted. I have a question for you. How have you gathered the data? Did you just collect it off the stream using something like Piston, or have you used some other technique to crawl/scrape steemit?

Sort:  

Thanks. I used selenium and xpath selectors.

Using the steemd API instead of scraping HTML will make your code much more robust. We know people use the steemd API and we try not to introduce breaking changes to it, but the HTML structure of steemit.com pages obviously isn't an API and we feel free to change it as needed, when needed.

Thanks that was the next step I had in mind, but I wanted to experiment.

Coin Marketplace

STEEM 0.20
TRX 0.13
JST 0.030
BTC 65733.39
ETH 3506.40
USDT 1.00
SBD 2.51