Happy new year everybody!
Everyone knows Google, right? They have a huge data-center that stores all website around the world. How can they do it? With a bot spider! Very intelligent spider. And we gonna made it, but simpler, and for our own purpose.
We don't have to defeat Google or competitive with them, we just learn from them to create ours.
For my reason, I want to create a bot, a spider or a crawler that can help me collect all articles from some e-news websites, some blogs, and from Google caches (for website is dead). Cut out the content and put it in my database (for my website, for my other activities, etc).
I hope you come her with same purpose as me, or have the same attitude to learn how crawler work. I'm really want to share and learn about new thing with other people. And I'm always open for your contribute, your comment and feedback to improve other skill.
Feel free to join group, feel free to prove me wrong (both coding and English). And I'm welcome you to be co-founder of this group too.
P/S: I'm new to python, so I don't know much, but I will try to post 1 tutorial per week around topic crawler and scrapping.
A few worlds about myself:
I'm Thang, a web developer and designer (*I can use photoshop and html/css too*), and I'm 24 y/o (*so you can call me bro, dude, noob if you want*). I used to work as freelancer, now I'm build e-com website for my friend. And I will find a new job on June, this year.
Haha no way... My very first SETT post was about writing a scraper in perl and using wGet. Small world, although it's probably more likely that the people who read Tynan's blog and thus are likely to use SETT are all very similar... Data mining
So back in January, I wrote out my 7 goals for the year. It's been two months, so let's see how I'm doing :
1. Become FULLY polyphasic
I'm close on this one. Many days I go perfectly, sometimes if I have nothing to do I oversleep and then skip some naps during the day. I'm actually pretty satisfied with that, as I'm only sleeping 2.5-4.5 hours per night, I'm never tired, and can always count on being awake early and staying up late. I'll keep pressing to be more consistent, but I'm satisfied with where I am.
I still live in the RV. People ask how long I'm going to stay here and honestly I don't know. I love it so much that I don't even want to leave. I DO want a solar panel, though, which gets installed thursday. That's exciting. Did I already mention that I had 5 people over to play cards? That's six people in the tiny RV including me, which is my personal record. Soon I will have a house party.
I learned (the basics of) PHP, MYSQL, and AJAX in the past week. They have such scary names I assumed that it would be really difficult to learn, but in fact it's super easy. I'm making a quiz site (like those myspace quizzes), and I already have most of it done. It's even fancy and ajaxy. If you're a lady, pretend that this last paragraph was about me saving kids from a fire, and not about nerdy stuff. Thanks.
My eyes are getting better and better from PRK. Still not totally recovered, but almost certainly 20/20. We go in for a checkup in a few days, so we'll see what the doctor says.