rbnf-rts

Runtime support for generated parsers of RBNF.hs

P.S:

rbnf-rts is the new fastest Python parser generator by 2020, according to a benchmark given by benchmark-json.

Features

Syntax-driven
Left recursion with LL parsing(yes, left recur in LL, it's right)
Grammar inline optimizations
Smarter lookahead via ID3 algorithm
Statically generated(no need to create the parser again and again when starting your application)
Good error reports(including the position and nested rule names of parsing error)
RBNF.hs is language independent
[WIP]: context sensitive parsing as an extension to CFG

More Examples

Check the test directory:

test
- multiply : parser/lexer implementation for multiplications
- arith : parser/lexer implementation for arithmetics
- relax : parser/lexer implementation for a full-featured programming language
- llvmir: parser/lexer implementation for LLVM IR, nearly full-featured
- rbnfjson: a json parser implemented by RBNF.hs and rbnf-rts

In each sub-directory of test, you can run tests via directly invoking the test.sh., like cd test/llvmir && bash test.sh

Native Dependencies

The Haskell Stack Toolchain

The executable rbnf-pgen in PATH.

If ~/.local/bin is already in PATH:

git clone https://github.com/thautwarm/RBNF.hs
cd RBNF.hs
stack install .

Example: Multiplications

write a multiply.exrbnf file

# 'mul' is a python global which should be marked as 'required' in .rlex
Mul    : lhs=Mul "*" rhs=Atom { mul(lhs, rhs) }
       | a=Atom  {a}
       ;

# 'unwrap' should be marked as 'required', just as 'mul'
Atom   :  "(" !a=Mul ")"  {a} ;
       |  <number>        { unwrap($0) };
START ::= <BOF> Mul <EOF> { $1 };

or write a multiply.rbnf file:

# 'mul' is a python global which should be marked as 'required' in .rlex
Mul    : !lhs=Mul "*" !rhs=Atom            -> mul(lhs, rhs);
Mul    : !a=Atom                           -> a;
Atom   : "(" !a=Mul ")"                    -> a;

# 'unwrap' should be marked as 'required', just as 'mul'
Atom   : !a=<number>                       -> unwrap(a);
START ::= <BOF> !a=Mul <EOF>               -> a;

write a multiply.rlex file:

%require mul
%require unwrap
%ignore space
number [+-]?\d+
space \s+

codegen

sh> rbnf-pygen multiply.rbnf multiply.rlex multiply.py --k 1 --traceback

run statically-generated parsers and lexers and enjoy its efficiency

from rbnf_rts.rts import Tokens, State
from multiply import run_lexer, mk_parser
import operator

def unwrap(x: Token):
    return int(x.value)

scope = dict(mul=operator.mul, unwrap=unwrap)

parse = mk_parser(**scope)


tokens = list(run_lexer("<current file>", "-1 * 2 * (3 * 4)"))
got = parse(State(), Tokens(tokens))
print(got)

Got (True, -24), where True indicates the parsing succeeded.

If False, a list of errors will be given in the second element of the return tuple.

Menhir-like syntax sugars including list and separated_list.

A json parser more than 20% efficient than that of lark-parser(lol), written as a .exrbnf file.

START: <BOF> value <EOF> { $1 };

value: <ESCAPED_STRING> { DQString(*$0) }
     | <SIGNED_INT>     { int(*$0) }
     | <SIGNED_FLOAT>   { float(*$0) }
     | "true" { True }
     | "null" { None }
     | "false" { False }
     # array
     | '[' ']' {[]}
     | '[' separated_list(',', value) ']' { $1 }
     # object
     | '{' '}' { dict() }
     | '{' separated_list(',', pair) '}' { dict($1) }
     ;
pair   : <ESCAPED_STRING> ":" value { (DQString(*$0), $2) };

Check rbnfjson for more information.

rbnf-rts
Release 1.0

Release 1.0

1.0

0.7

0.6.1

0.6

0.5.1

0.5

0.42

0.41

0.4

0.3

Documentation

rbnf-rts

Features

More Examples

Native Dependencies

Example: Multiplications

Stats

Development practices

Releases

Contributors

rbnf-rts Release 1.0

Release 1.0 Toggle Dropdown 1.0 0.7 0.6.1 0.6 0.5.1 0.5 0.42 0.41 0.4 0.3

Documentation

rbnf-rts

Features

More Examples

Native Dependencies

Example: Multiplications

Stats

Development practices

Releases

Contributors

rbnf-rts
Release 1.0

Release 1.0

1.0

0.7

0.6.1

0.6

0.5.1

0.5

0.42

0.41

0.4

0.3