jutge

Simple functions to read input from Python


Keywords
jutge, org, education, input
License
Apache-2.0
Install
pip install jutge==2.1

Documentation

The jutge package

Build Status Coverage Status Python Versions PyPi Version

Logo

This tiny package offers a simple interface to read input from stdin, files, and other io streams. It was built in order to offer beginners an easy interface to read data in Jutge.org problems. The behaviour of the reading functions mimic C++'s cin input stream.

  1. Installation
  2. Quickstart tutorial
  3. Usage
    1. Reading tokens
    2. Reading lines
    3. Multiple tokens
    4. Specifying the file stream
    5. Basic types
    6. User defined types
  4. Full reference
    1. Interface
    2. EOF handling modes
    3. Extra features
    4. Version
  5. Credits
  6. License

Installation

  • Python3:
    • Install with pip3 install jutge.
    • Upgrade to latest version with pip3 install --upgrade jutge.
    • Uninstall with pip3 uninstall jutge.
  • Python:
    • Install with pip install jutge.
    • Upgrade to latest version with pip install --upgrade jutge.
    • Uninstall with pip uninstall jutge.

Quickstart tutorial

This is a short learn-by-example tutorial, and does not include all the functionality the package offers. For complete details, refer either to the Usage section or to the Full reference.

This package offers a read function that returns the next token in the input. The type of the token must be given as a parameter: read(int), read(float), read(str), read(chr)... In the event no more tokens are available, read returns None.

Except for characters, tokens are separated by whitespace, so that read(str) returns the next word. Whitespace characters cannot be obtained.

Sample program to read two numbers and write their sum:

from jutge import read

x = read(int)
y = read(int)
print(x + y)

The package also includes the function read_many, which reads tokens as long as there are more (of the requested type).

Sample program to compute the sum of a sequence of integers with read_many:

from jutge import read_many

print(sum(read_many(int)))

Equivalent to:

from jutge import read_many

s = 0
for num in read_many(int):
    s += num

print(s)

Supported built-in types are specified here.

Both read and read_many support specifying more than one type at once, as in read(int, chr). More details can be found here.


Sample program to read a character c and a number n and print the character that comes n positions after c (in the ASCII alphabet):

from jutge import read

c, n = read(chr, int)
print(chr(ord(c) + n))


Usage

There are four main functions available to the user: read, read_many, read_line, and read_many_lines. All three of them can be imported at once by using the "import all" syntax:

from jutge import *

read()
read_many()
read_line()

Of course, you can also import the whole jutge namespace and prefix each function name with jutge:

import jutge

jutge.read()
jutge.read_many()
jutge.read_line()

Reading tokens

read

The read function returns the next token in the input. The type of the token must be given as a parameter: read(int), read(float), read(str), read(chr)... Except for characters, tokens are separated by whitespace, so that read(str) returns the next single word. Whitespace characters cannot be obtained.

If no type is specified—i.e., calling read()—it defaults to str.


Example
Sample program to read two numbers and write their maximum:

from jutge import read

x = read(int)
y = read(int)
print(max(x, y))

Input:

    2
            -5

Output: 2


In the event no more tokens are available, read returns None, as default behaviour—see EOF handling modes for more details.


Example

from jutge import read

print(read())
print(read())

Input: hello

Output: None



read_many

read_many is a function that reads tokens as long as it can— i.e., while there still are tokens (of the requested type) to be read. It has the same signature as read; it takes one (or more) types as arguments.

When called it returns a generator that can be iterated through with a for loop; see the examples below.


Example
The following script showcases the behaviour of read_many. The for loop iterates though all integers in the input (until it encounters something that isn't an integer, or there is nothing else to be read) and prints each of those numbers squared.

Finally, a string is read, to show that the rest of the input is still available to be read.

from jutge import read_many, read

for num in read_many(int):
    print(num**2)

print(read())  # Should print 'hello'

Input:

    2 3
    
4
5
    -1

12 hello

goodbye

Output:

4
9
16
25
1
144
hello

Since read_many returns a generator, it does not consume all the input beforehand; rather, each token is fetched "on-demand" at each iteration. Thus, the loop can be exited at any time. See the following example.


Example

from jutge import read_many, read

print("Try to guess my number!")
for num in read_many(int):
    if num == 42:
        print("Correct! You guessed it!")
        break
    print("{} is incorrect!".format(num))

x, y = read(int), read(int)
print("{} + {} = {}".format(x, y, x + y))  # Should print '5 + 2 = 7'

Input:

7
3
11
5
81
42
5
2

Output:

Try to guess my number!
7 is incorrect!
3 is incorrect!
11 is incorrect!
5 is incorrect!
81 is incorrect!
Correct! You guessed it!
5 + 2 = 7

Example

Get the sum of a sequence of integers in the input.

from jutge import read_many

print(sum(read_many(float)))

Input:

3.14 6.28 0.333
12 100 -51

Output: 70.753


Keyword arguments

amount

Both read and read_many take the optional keyword-argument amount. It specifies the amount of times to "concatenate" the requested types. I.e., read(int, amount=4) is equivalent to read(int, int, int, int).

It is useful when you need to read a large (but fixed) number of tokens, as in read(int, amount=1000).

It is compatible with heterogeneous types. I.e., read(str, int, amount=2) is equivalent to read(str, int, str, int).


as_list

By default, when read is called with multiple types, it returns a list with the requested tokens. If, for some reason, you don't need implicit conversion to a list, read can return a generator instead. This is achieved by passing as_list=False, as in read(float, amount=10000, as_list=False).

This has the advantage of avoiding unnecessary iteration to create the list and copy the tokens into it. For example, you could use it to create a large numpy array:

import numpy as np
from jutge import read

arr = np.array(read(float, amount=1000000, as_list=False))

In the previous block of code, only one iteration through all of the tokens takes place—the one to fill the np.array— whilst if as_list were set to False, two iterations would take place: one to fill the list, and another to fill the array.

Keep in mind that the generator returned by read when as_list is set to True does not consume any input until you iterate through it. This is the reason for which read_many does not support this keyword argument: it must consume input in order to discern whether it should keep going or not, so it always returns multiple-token queries as lists.


file

See Specifying the file stream.


Reading lines

read_line

This function takes no positional arguments. It returns the next line in the input stream. By default, trailing newline characters (CR, LF, or CRLF) are stripped (see the rstrip keyword argument).

Note that in this case lines that only contain whitespace (other than the trailing newline), such as ' \t \n', will be considered "non-empty", and thus returned. If you don't care about whitespace, you should maybe be looking at reading individual tokens with read.


Example

from jutge import read, read_line

print(read())
line = read_line()
print(line, end='')
print(repr(line))

Input

Hi, I'm a line
Hello world!
foo

Output

Hi,
Hello world!
'Hello world!\n'


read_many_lines

The behaviour of this function with respect to read_line is analogous to the behaviour of read_many with respect to read; it yields lines as long as there are more.


Keyword arguments

rstrip

Both read_line and read_many_lines allow this keyword argument. When set to True (which it is by default), trailing newline characters are stripped by calling .rstrip('\r\n').

skip_empty

Both read_line and read_many_lines allow this keyword argument. When set to True (by default it is set to False, empty lines (i.e. lines that only contain whitespace) are skipped, and returned lines are stripped.


Multiple tokens

Both read and read_many support specifying more than one type, as in read(int, chr).

In read's case, this usage returns an iterable sequence—see here for more details—of tokens, one for each type requested, respecting their order. For example, read(int, chr, int) will return a sequence containing an int, a chr, and an int, in that order.


Example
Reading more than one token at once:

from jutge import read

x, y = read(chr, int)
print(repr(x), type(x))
print(repr(y), type(y))

z, w = read(int, int)
print(z*w)

Input: D3 4 2

Output:

'D' <class 'str'>
3 <class 'int'>
8

In this regard, read_many behaves exactly like before. Each iteration through the generator yields the equivalent of what would be obtained by calling read with the same arguments.

That is, the following two loops are (almost) equivalent.

from jutge import read_many

for char, num in read_many(chr, int):
    pass  # Code goes here
from jutge import read

char, num = read(chr, int)
while char is not None and num is not None:
    pass  # Code goes here
    char, num = read(chr, int)

The only difference is that read_many will automatically stop when subsequent input does not match the desired types, whilst the second, "manual", loop, will stop with a ValueError.

Note: be careful when using read_many with heterogeneous types. It will stop when it encounters input that don't match the types, but those tokens will be "consumed", since right now there is no backtracking implemented.


Specifying the file stream

All four functions accept a file keyword argument that specifies the file stream to read from, as a file object or other io stream object. It defaults to the standard input stream (as defined by the input() built-in).


Example
Read each number from an open file:

from jutge import read_many

with open('file.txt') as f:
    for num in read_many(int, file=f):
        print(num)

Input (file.txt):

1   2

  3     4
 
5
6

Output:

1
2
3
4
5
6

One can use io.StringIO to read from strings by converting them into streams, as the following example shows.


Example

import io
from jutge import read

string = "21 40\t"
stream = io.StringIO(string)
x, y = read(int, int, file=stream)
print(x + y)

Output: 41


Note: In Python 2.x, one must use unicode literals to be able to use io.StringIO, by including the line

from __future__ import unicode_literals

at the top of the module, or by writing string literals with the 'u' prefix: u"I'm a unicode string".


Reading from an interactive console session

As of version 2.0, the standard input stream has been made to work with the input() built-in, which means everything also works in interactive console sessions.

Keep in mind that to signal end of input you will have to enter control + d on Linux and Mac or control + z on Windows.


Basic types

The read function supports the following basic built-in types:

  • integer (int),
  • floating point (float),
  • character (chr),
  • string (str).

User defined types

Any type whose constructor accepts a string is also supported; for example read(iter) will return a string iterator.


Example
Read an instance of a custom class whose constructor accepts a string, and saves an uppercase copy of the string:

from jutge import read

class MyType:
    def __init__(self, word): 
        self.word = word.upper()
    
    def say_word(self): 
        print(self.word)

a = read(MyType)  # a = MyType(<input string>)
a.say_word()

Input: hello

Output: HELLO



Full reference

Interface

  • (function) jutge.read(*types, amount=1, file=_StdIn, as_list=True)
    Returns tokens from the input stream specified by file. _StdIn is an alias for standard input.

    Parameters:

    • *types: callable
      For each type given in types, a token is returned converted to the specified type, in the given order.

    • amount: int
      Its effect is to repeat the sequence of types by the specified amount. That is, read(*types, amount=n) is equivalent to read(*(types*n)), where types is a tuple of types and n is an integer—but the former is more efficient.

    • file
      A file object or other iterable input stream object (e.g. io.StringIO) that specifies where to read input from.

    • as_list: bool
      If True, if more than one token is requested, converted tokens are returned as a list, and thus input is immediately consumed. Otherwise, a generator is returned (and no input is consumed).

  • (function) jutge.read_many(*types, amount=1, file=_StdIn)
    Returns a generator that yields tokens from the input stream as long as there are more tokens to be read of the requested type. Each next call to this generator yields the result of read(*types, amount=amount, file=file, as_list=True). _StdIn is an alias for standard input.

    Parameters: refer to the documentation for read's parameters

  • (function) jutge.read_line(*, file=_StdIn, rstrip=True, skip_empty=True)
    Returns the next line in the input stream specified by file. _StdIn is an alias for standard input. Lines are given raw (without stripping). By default, trailing newlines (CR, LF, or CRLF) are stripped (see the rstrip keyword argument).

    Parameters:

    • file
      A file object or other iterable input stream object (e.g. io.StringIO) that specifies where to read input from.

    • rstrip: bool
      If set to True, trailing newline characters are stripped by calling .rstrip('\r\n'). Note that if you intend to strip the whole line anyways, this may cause overhead since strings are immutable, and thus stripping or "modifying" the string requires making a copy; that is, if you intend to call .strip() on the result, set rstrip to False.

    • skip_empty: bool
      If true, strictly empty lines (i.e. lines with just the newline character) will be skipped, and returned lines will be stripped.

  • (function) jutge.read_many_lines(*, file=_StdIn, rstrip=True, skip_empty=True)
    Yields lines from the input stream as long as there are more to be read. Returns a generator object when called.

    Parameters: refer to the documentation for read_line's parameters

  • (function) jutge.set_eof_handling(mode)
    Sets the global end of input handling mode to mode (a member of the jutge.EOFModes enum). See [EOF handling modes](EOF handling modes) for more information.

    Parameters:

    • mode: jutge.EOFModes
      Handling mode to set.
  • (enum.Enum) jutge.EOFModes
    Enumeration class with the different EOF handling modes. See [EOF handling modes](EOF handling modes) for more information.

    Members:

    • RaiseException
      When active, this handling mode enables the raising of an EOFError exception when end of input is reached.

    • ReturnNone
      When active (default), this handling mode makes functions like read return None as tokens when the end of input is reached.


EOF handling modes

The behaviour of the package's functions when encountering EOF (the end of input) can be customized with set_eof_handling, by passing one of the members of the EOFModes enum. Currently, the enum holds two options:

  • ReturnNone
  • RaiseException

When ReturnNone is set, a None object is returned for each token that is requested after end-of-input. When RaiseException is set, an EOFError exception is raised instead.

The first mode may seem more simple because it does not involve any exceptions. But the second mode is actually much easier to debug, since if you read more than you thought you would have to read (i.e., beyond the end of the input), that probably means something is wrong with your program, and thus halting the program immediately and showing a traceback is much more beneficial than gobbling up the error in silence and potentially passing on the error to some other unrelated part of your code.

As a side matter, read_many always functions in RaiseException mode, since it needs to catch the error to shut down properly. There is a way to avoid doing this, but it involves checking for None objects in the output tokens, which is much more cumbersome.

For backwards compatibility reasons, ReturnNone is the default.


Example
Changing the EOF mode to RaiseException:

from jutge import *

set_eof_handling(EOFModes.RaiseException)


Extra features

Recursion limit

When importing the jutge package, the maximum depth of the Python interpreter stack is increased (using sys.setrecursionlimit(1000000)). This feature helps solving some recursive problems in Jutge.org.


Version

The variable jutge.version contains the version of the package.


Credits


License

Apache License 2.0