MUGAlyser

MUGAlyser - a script to extract data from Meetup into a MongoDB database


Keywords
Meetup, MUGS, MongoDB, API
License
Other
Install
pip install MUGAlyser==1.0.6a0

Documentation

MUGAlyser

Version 0.8 Beta

Updated audit schema to make it more consistent. Added a utility program to get info (called muginfo.py). Now data is read from a MUGS.py file so no reading of files is required. You can still get a single MUG instance by using the --mug parameter.

Added support for replica set parameters so we can connect to Atlas.

There is now a very basic REST API for some of the functions in webgui.

VERSION : 0.9 beta

MUGAlyser is a python program that uses the Meetup API to extract longitudinal data from the Meetup website for a collection of meetup groups. Its designed to capture MongoDB meetup information but it could be used to collect data on any group of meetups by changing the list of urlnames in the mugs.py file. You can also get specific mugs using the --mug parameter.

It puts the data in a MongoDB database. There are seperate collections for groups, members, past events and upcoming events.

There is also an audit collection which tracks when the data was collected. Each record is associated with a Batch ID which is tracked in the audit collection.

This is beta software. Not all combinations of options have been tested.

JD10Gen-old:mugalyser jdrumgoole$ python mugalyser_main.py -h
usage: mugalyser_main.py [-h] [--database DATABASE] [--host HOST]
                         [--port PORT] [--username USERNAME]
                         [--password PASSWORD] [--replset REPLSET]
                         [--admindb ADMINDB] [--ssl] [--multi] [--verbose]
                         [-v] [--wait WAIT] [--trialrun]
                         [--mugs MUGS [MUGS ...]]
                         [--attendees ATTENDEES [ATTENDEES ...]]
                         [--loglevel LOGLEVEL]

A program to read data from the Meetup API into MongoDB

  Licensed under the AFGPL
  https://www.gnu.org/licenses/agpl-3.0.en.html

  Distributed on an "AS IS" basis without warranties
  or conditions of any kind, either express or implied.

USAGE

optional arguments:
  -h, --help            show this help message and exit
  --database DATABASE   specify the database name [default: MUGS]
  --host HOST           hostname [default: localhost]
  --port PORT           port name [default: 27017]
  --username USERNAME   username to login to database
  --password PASSWORD   password to login to database
  --replset REPLSET     replica set to use [default: ]
  --admindb ADMINDB     Admin database used for authentication [default:
                        admin]
  --ssl                 use SSL for connections
  --multi               use multi-processing
  --verbose             set verbosity level [default: None]
  -v, --version         show program's version number and exit
  --wait WAIT           How long to wait between processing the next parallel
                        MUG request [default: 5]
  --trialrun            Trial run, no updates [default: False]
  --mugs MUGS [MUGS ...]
                        Process MUGs list list mugs by name or use "all"
  --attendees ATTENDEES [ATTENDEES ...]
                        Capture attendees for these groups
  --loglevel LOGLEVEL   Logging level [default: INFO]
JD10Gen-old:mugalyser jdrumgoole$