Popularity
0.4
Growing
Activity
-
-
15
3
3

Programming language: - - -
Tags: Misc    

parser4k alternatives and similar libraries

Based on the "Misc" category.
Alternatively, view parser4k alternatives based on common mentions on social networks and blogs.

Do you think we are missing an alternative of parser4k or a related project?

Add another 'Misc' Library

README

Parser4k

Recursive descent parser combinator library for Kotlin with support for left recursion. It aims to be:

  • simple - very few core concepts
  • easy to use - you can quickly figure out how to write a parser for a small language
  • production-ready - enough functionality and performance for real-world applications

Examples

object MinimalCalculator {
    val cache = OutputCache<BigDecimal>()
    fun binaryExpr(s: String) = inOrder(ref { expr }, token(s), ref { expr })

    val number = oneOrMore(oneOf('0'..'9')).map { it.joinToString("").toBigDecimal() }.with(cache)
    val paren = inOrder(token("("), ref { expr }, token(")")).skipWrapper().with(cache)

    val power = binaryExpr("^").map { (l, _, r) -> l.pow(r.toInt()) }.with(cache)
    val divide = binaryExpr("/").mapLeftAssoc { (l, _, r) -> l.divide(r) }.with(cache)
    val multiply = binaryExpr("*").mapLeftAssoc { (l, _, r) -> l * r }.with(cache)

    val minus = binaryExpr("-").mapLeftAssoc { (l, _, r) -> l - r }.with(cache)
    val plus = binaryExpr("+").mapLeftAssoc { (l, _, r) -> l + r }.with(cache)

    val expr: Parser<BigDecimal> = oneOfWithPrecedence(
        oneOf(plus, minus),
        oneOf(multiply, divide),
        power,
        paren.nestedPrecedence(),
        number
    ).reset(cache)

    fun evaluate(s: String) = s.parseWith(expr)
}

See also:

  • [Full source code for the calculator parser](src/test/kotlin/parser4k/examples/calculator.kt)
  • [Json parser](src/test/kotlin/parser4k/examples/json/json-parser.kt)
  • [Expression language parser](src/test/kotlin/parser4k/examples/expression-lang.kt)

Core concepts

  • Parser is any object which takes Input, attempts to extract some useful data from it and returns Output if successful or null if Parser wasn't able to consume any data. Parsers can be mapped with .map() function similar to how collections are mapped in Kotlin.
  • Input is an immutable object which contains input string and offset, where offset indicates how many characters has been consumed from the string.
  • Output is an immutable object which contains payload (i.e. useful data extracted from input) and Input with shifted offset (to be used by the next Parser).
interface Parser<out T> {
    fun parse(input: Input): Output<T>?
}

data class Input(
    val value: String,
    val offset: Int = 0,
    val leftPayload: Any? = null
)

data class Output<out T>(
    val payload: T,
    val nextInput: Input
)

fun <T, R> Parser<T>.map(transform: (T) -> R) = object : Parser<R> {
    override fun parse(input: Input): Output<R>? {
        val (payload, nextInput) = [email protected](input) ?: return null
        return Output(transform(payload), nextInput)
    }
}

Core parsers

  • str() - consumes input from the current offset if it's equal to the specified string
  • repeat(), zeroOrMore(), oneOrMore(), optional() - applies specified parser multiple times
  • inOrder() - apply each of the specified parsers sequentially ("sequence" could be a good name but it's already used in Kotlin)
  • ref { parser }, ::parser.ref() - workaround to forward-reference fields in Kotlin
  • oneOf() - apply the first matching parser
  • oneOfLongest() - apply the parser which can consume the longest part of the input
  • oneOfWithPrecedence() - apply the first matching parser with arguments indices as operator precedence, where high precedence means that parser will attempt to consume input at the "innermost" level and will produce payload before parsers with lower precedence.
  • .nestedPrecedence() - can be used in combination with oneOfWithPrecedence() on parsers which have nested precedence (e.g. parenthesis)
  • .mapLeftAssoc() - can be used with inOrder() to produce left-associative payload
  • .with(outputCache) - cache parsers output (should be used with any non-toy parser combinator to avoid exponential time complexity)
  • .reset(outputCache) - should be used on the main parser to reset output cache after processing each input
  • .with("parserId", parsingLog) - wrap parser with a logger (can be useful for debugging/understanding what parser is doing, e.g. see [log-tests.kt](src/test/kotlin/parser4k/log-tests.kt)

Common parsers

  • .joinedWith() - wrap parser so that it matches multiple times with specified separator (e.g. a list of arguments to a function separated by commas)
  • inOrder(...).skipFirst(), .skipLast(), .skipWrapper() - skip first, last or both first and last items from payload
  • .asBinary() - makes function with 2 arguments usable as a binary operator, e.g. inOrder(ref { expr }, token("+"), ref { expr }).mapLeftAssoc(::Plus.asBinary())
  • commonparsers.Tokens - parsers for basic bits of input (usually called "tokens"), e.g. whitespace or number

Installation

In Gradle, install the BOM and then any other modules in the dependency block:

implementation platform("dev.forkhandles:forkhandles-bom:X.Y.Z")
implementation "dev.forkhandles:parser4k"

See also