Functions
def	_log_unsupported_language (language, supported_languages)

def	extract_datetime (text, anchorDate=None, lang=None, default_time=None)

def	extract_duration (text, lang=None)

def	extract_number (text, short_scale=True, ordinals=False, lang=None)

def	extract_numbers (text, short_scale=True, ordinals=False, lang=None)

def	fuzzy_match (x, against)

def	get_gender (word, context="", lang=None)

def	match_one (query, choices)

def	normalize (text, lang=None, remove_articles=True)

Function Documentation

def mycroft.util.parse._log_unsupported_language	(	language,
		supported_languages
	)

private

Log a warning when a language is unsupported

Arguments:
    language: str
        The language that was supplied.
    supported_languages: [str]
        The list of supported languages.

Definition at line 43 of file parse.py.

def mycroft.util.parse.extract_datetime	(	text,
		anchorDate = `None`,
		lang = `None`,
		default_time = `None`
	)

Extracts date and time information from a sentence.  Parses many of the
common ways that humans express dates and times, including relative dates
like "5 days from today", "tomorrow', and "Tuesday".

Vague terminology are given arbitrary values, like:
    - morning = 8 AM
    - afternoon = 3 PM
    - evening = 7 PM

If a time isn't supplied or implied, the function defaults to 12 AM

Args:
    text (str): the text to be interpreted
    anchorDate (:obj:`datetime`, optional): the date to be used for
        relative dating (for example, what does "tomorrow" mean?).
        Defaults to the current local date/time.
    lang (str): the BCP-47 code for the language to use, None uses default
    default_time (datetime.time): time to use if none was found in
        the input string.

Returns:
    [:obj:`datetime`, :obj:`str`]: 'datetime' is the extracted date
        as a datetime object in the user's local timezone.
        'leftover_string' is the original phrase with all date and time
        related keywords stripped out. See examples for further
        clarification

        Returns 'None' if no date or time related text is found.

Examples:

    >>> extract_datetime(
    ... "What is the weather like the day after tomorrow?",
    ... datetime(2017, 06, 30, 00, 00)
    ... )
    [datetime.datetime(2017, 7, 2, 0, 0), 'what is weather like']

    >>> extract_datetime(
    ... "Set up an appointment 2 weeks from Sunday at 5 pm",
    ... datetime(2016, 02, 19, 00, 00)
    ... )
    [datetime.datetime(2016, 3, 6, 17, 0), 'set up appointment']

    >>> extract_datetime(
    ... "Set up an appointment",
    ... datetime(2016, 02, 19, 00, 00)
    ... )
    None

Definition at line 202 of file parse.py.

def mycroft.util.parse.extract_duration	(	text,
		lang = `None`
	)

Convert an english phrase into a number of seconds

Convert things like:
    "10 minute"
    "2 and a half hours"
    "3 days 8 hours 10 minutes and 49 seconds"
into an int, representing the total number of seconds.

The words used in the duration will be consumed, and
the remainder returned.

As an example, "set a timer for 5 minutes" would return
(300, "set a timer for").

Args:
    text (str): string containing a duration
    lang (str): the BCP-47 code for the language to use, None uses default

Returns:
    (timedelta, str):
                A tuple containing the duration and the remaining text
                not consumed in the parsing. The first value will
                be None if no duration is found. The text returned
                will have whitespace stripped from the ends.

Definition at line 166 of file parse.py.

def mycroft.util.parse.extract_number	(	text,
		short_scale = `True`,
		ordinals = `False`,
		lang = `None`
	)

Takes in a string and extracts a number.

Args:
    text (str): the string to extract a number from
    short_scale (bool): Use "short scale" or "long scale" for large
        numbers -- over a million.  The default is short scale, which
        is now common in most English speaking countries.
        See https://en.wikipedia.org/wiki/Names_of_large_numbers
    ordinals (bool): consider ordinal numbers, e.g. third=3 instead of 1/3
    lang (str): the BCP-47 code for the language to use, None uses default
Returns:
    (int, float or False): The number extracted or False if the input
                           text contains no numbers

Definition at line 126 of file parse.py.

def mycroft.util.parse.extract_numbers	(	text,
		short_scale = `True`,
		ordinals = `False`,
		lang = `None`
	)

    Takes in a string and extracts a list of numbers.

Args:
    text (str): the string to extract a number from
    short_scale (bool): Use "short scale" or "long scale" for large
        numbers -- over a million.  The default is short scale, which
        is now common in most English speaking countries.
        See https://en.wikipedia.org/wiki/Names_of_large_numbers
    ordinals (bool): consider ordinal numbers, e.g. third=3 instead of 1/3
    lang (str): the BCP-47 code for the language to use, None uses default
Returns:
    list: list of extracted numbers as floats, or empty list if none found

Definition at line 97 of file parse.py.

def mycroft.util.parse.fuzzy_match	(	x,
		against
	)

Perform a 'fuzzy' comparison between two strings.
Returns:
    float: match percentage -- 1.0 for perfect match,
           down to 0.0 for no match at all.

Definition at line 59 of file parse.py.

def mycroft.util.parse.get_gender	(	word,
		context = `""`,
		lang = `None`
	)

Guess the gender of a word

Some languages assign genders to specific words.  This method will attempt
to determine the gender, optionally using the provided context sentence.

Args:
    word (str): The word to look up
    context (str, optional): String containing word, for context
    lang (str): the BCP-47 code for the language to use, None uses default

Returns:
    str: The code "m" (male), "f" (female) or "n" (neutral) for the gender,
         or None if unknown/or unused in the given language.

Definition at line 321 of file parse.py.

def mycroft.util.parse.match_one	(	query,
		choices
	)

    Find best match from a list or dictionary given an input

    Arguments:
        query:   string to test
        choices: list or dictionary of choices

    Returns: tuple with best match, score

Definition at line 68 of file parse.py.

def mycroft.util.parse.normalize	(	text,
		lang = `None`,
		remove_articles = `True`
	)

Prepare a string for parsing

This function prepares the given text for parsing by making
numbers consistent, getting rid of contractions, etc.

Args:
    text (str): the string to normalize
    lang (str): the BCP-47 code for the language to use, None uses default
    remove_articles (bool): whether to remove articles (like 'a', or
                            'the'). True by default.

Returns:
    (str): The normalized string.

Definition at line 281 of file parse.py.

Functions

Function Documentation