Happy new year everybody!
Everyone knows Google, right? They have a huge data-center that stores all website around the world. How can they do it? With a bot spider! Very intelligent spider. And we gonna made it, but simpler, and for our own purpose.
We don't have to defeat Google or competitive with them, we just learn from them to create ours.
For my reason, I want to create a bot, a spider or a crawler that can help me collect all articles from some e-news websites, some blogs, and from Google caches (for website is dead). Cut out the content and put it in my database (for my website, for my other activities, etc).
I hope you come her with same purpose as me, or have the same attitude to learn how crawler work. I'm really want to share and learn about new thing with other people. And I'm always open for your contribute, your comment and feedback to improve other skill.
Feel free to join group, feel free to prove me wrong (both coding and English). And I'm welcome you to be co-founder of this group too.
P/S: I'm new to python, so I don't know much, but I will try to post 1 tutorial per week around topic crawler and scrapping.
A few worlds about myself:
I'm Thang, a web developer and designer (*I can use photoshop and html/css too*), and I'm 24 y/o (*so you can call me bro, dude, noob if you want*). I used to work as freelancer, now I'm build e-com website for my friend. And I will find a new job on June, this year.