Class AnalysisResponseBase

    • Constructor Detail

      • AnalysisResponseBase

        public AnalysisResponseBase()
    • Method Detail

      • buildPhases

        protected List<AnalysisResponseBase.AnalysisPhase> buildPhases​(NamedList<Object> phaseNL)
        Parses the given named list and builds a list of analysis phases form it. Expects a named list of the form:
        
          <lst name="index">
              <arr name="Tokenizer">
                  <str name="text">the_text</str>
                  <str name="rawText">the_raw_text</str> (optional)
                  <str name="type">the_type</str>
                  <int name="start">1</str>
                  <int name="end">3</str>
                  <int name="position">1</str>
                  <bool name="match">true | false</bool> (optional)
              </arr>
              <arr name="Filter1">
                  <str name="text">the_text</str>
                  <str name="rawText">the_raw_text</str> (optional)
                  <str name="type">the_type</str>
                  <int name="start">1</str>
                  <int name="end">3</str>
                  <int name="position">1</str>
                  <bool name="match">true | false</bool> (optional)
              </arr>
              ...
          </lst>
         
        The special case is a CharacterFilter that just returns a string, which we then map to a single token without type.
        Parameters:
        phaseNL - The names list to parse.
        Returns:
        The built analysis phases list.
      • buildTokenInfoFromString

        protected AnalysisResponseBase.TokenInfo buildTokenInfoFromString​(String value)
        Convert a string value (from CharacterFilter) into a TokenInfo for its value full span.
        Parameters:
        value - String value
        Returns:
        The built token info (with type set to null)
      • buildTokenInfo

        protected AnalysisResponseBase.TokenInfo buildTokenInfo​(NamedList<Object> tokenNL)
        Parses the given named list and builds a token infoform it. Expects a named list of the form:
        
          <arr name="Tokenizer">
              <str name="text">the_text</str>
              <str name="rawText">the_raw_text</str> (optional)
              <str name="type">the_type</str>
              <int name="start">1</str>
              <int name="end">3</str>
              <int name="position">1</str>
              <bool name="match">true | false</bool> (optional)
          </arr>
         
        Parameters:
        tokenNL - The named list to parse.
        Returns:
        The built token info.