Popular Categories

Using Scrapy to scrape MITOPENCOURSEWARE site for syllabi

Course Queries Syllabus Queries 3 years ago

5K 1 0 0 0

Manpreet Singh

Previous Next

User submissions are the sole responsibility of contributors, with TuteeHUB disclaiming liability for accuracy, copyrights, or consequences of use; content is for informational purposes only and not professional advice.

Answers (1)

Post Answer

manpreet Best Answer 3 years ago

My spider to scrape the MITOPENCOURSEWARE site for syllabi doesn't work. Will someone please help me figure out what's wrong with it? The .*'s are to get to all the courses. Is this right?

  1 from scrapy.contrib.spiders import CrawlSpider, Rule
  2 from scrapy.contrib.linkextractors.sgml import SgmlLinkExtractor
  3 from scrapy.selector import HtmlXPathSelector
  4 from opensyllabi.items import OpensyllabiItem
  5 
  6 class MITSpider(CrawlSpider):
  7     name = 'mit'
  8     allowed_domains = ['ocw.mit.edu']
  9     start_urls = ['http://ocw.mit.edu/courses']
 10     rules = [Rule(SgmlLinkExtractor(allow=['/.*/.*/syllabus']), 'parse_syllabus')]
 11 
 12     def parse_syllabus(self, response):
 13         x = HtmlXPathSelector(response)
 14 
 15         syllabus = OpensyllabiItem()
 16         syllabus['url'] = response.url
 17         syllabus['body'] = x.select("//div[@id='course_inner_section']").extract()
 18         return syllabus

0 views

0 shares

No matter what stage you're at in your education or career, TuteeHUB will help you reach the next level that you're aiming for. Simply,Choose a subject/topic and get started in self-paced practice sessions to improve your knowledge and scores.

Popular Categories

Using Scrapy to scrape MITOPENCOURSEWARE site for syllabi

Manpreet Singh

Answers (1)

manpreet Best Answer 3 years ago

Similar Forum

Neet 2019 syllabus will change?

Syllabus in LaTeX

Does Hogwarts follow any specific syllabus for DADA?

Explore Other Libraries

Online Exams

Question Bank

Career News

Feeds

Full Forms

Dictionary

Interview Question

Gigs

Quotes

Lyrics

Videos

Courses

Blogs

Tutorials

Forum

Educators

Corporates

Tools

Related Searches

Important Course Queries Links

Join Our Community Today