google-scholar-scraper

Python library for scraping Google Scholar.


Keywords
references citations bibliography
License
GPL-2.0+
Install
pip install google-scholar-scraper==0.2

Documentation

google-scholar-scraper

Version 0.2 works with Python3.

Installation

$ pip install google-scholar-scraper

Command-line usage

$ gsscraper "neeman grothendieck duality"

will return the first result from Google Scholar matching this query, in XML format.

$ gsscraper "neeman grothendieck duality" -n 5

will return the five results from Google Scholar matching this query. (Max is 10 results.)

Library usage

import gsscraper

query = "neeman grothendieck duality"
gsscraper.get_result(query) # (a)
gsscraper.get_results(query, 5) # (b)
gsscraper.get_result_as_xml(query) # (c)

Here,

  1. will return a Python dict with keys "title", "author", etc.;
  2. will return a list of such Python dicts;
  3. will return a list of strings in XML format.

License

GPL

Authors

Adeel Khan (@adeel)

Parts of the code are derived from the package gscholar by Bastian Venthur (@venthur).