# This file is read (up to "end") by all lq-text programs. common CommonWords docpath /usr/spool/news:/home/ss4000c/lee/text: minwordlength 3 maxwordlength 18 wordlist on end # each line in this file is either blank, a comment line starting with #, # or contains a keyword followed by an option value. The keyword "end" # takes no options, and the remainer of the file is not read. # # the keywords must appear exactly as shown, or in all lower case. # # keyword arguments can appear in quotes; you can use \" to escape a quote, # and \\ to escape a \ character. # # Common common-file # --- giving the name of a file of common words # (you can use "stoplist" instead of "common"; it is the same) # Docpath "path" (the quotes are optional) # --- giving a list of places to look for files, separated by ":" # Useful tip: avoid putting :: or "." in DOCPATH, as you'll # then get files that might or might not be found, depending # on where you happen to be. $HOME is NOT understood in here. # # Docpath can be replaced by the environment variable $DOCPATH. # # MinWordLen _n_ # --- words shorter than this (before plurals are removed) are # not indxed. # MaxWordLen _n_ # --- words are truncated to this length if they were longer. # # WordList on|off # --- if on, words are stored in two places. This increases the size # of the database slightly, but it lets you use "lqwordlist" to # search the database vocabulary, e.g. # lqwordlist -g 'ly$' # finds all words ending in "ly" (-g allows egrep patterns) # and lqquery 'boy*' will also work. You can save between 5% # to 10% of the database size by turning this off in some cases. # # echo hello # this will make any lq-text program print "hello" to stderr as it reads # the config file; it's probably not very useful. # # FileBlockSize N # --- this is the number of bytes in a file block; the default is # approx. 128 * minwordlength. The value must be the same for # the lifetime of the database. Increasing this will slightly # improve phrase matching, but may slightly increase the # database size. # # IndexNumbers On|Off # --- if on, numbers (sequences of digits) are indexed as if # they were words. Default is off. # # ConvertNumbers on|Off # --- if on, octal numbers (with a leading 0) are converted to decimal, # as are hex numbers with a leading 0x. # # # IgnoreHTMLhead [on|off # --- this is really an ugly hack. If On, nothing is indexed in an # HTML file (ending in .htm or .html) until a tag is reached # that starts with a "b" or "B", e.g. . # # locale string # --- sets locale (by calling setlocale()) to the given value. This # will affect lq-text's idea of what constitutes a word. # # wordflags flags|separated|thus # # --- only these flags will be stored. # You can use the following flags: # Plural # UpperCase # Possessive # LastHadPunct # LastWasCommon # LastHadLetters # HasStuffBefore # NextHasPunct # NextIsCommon # All # Example: # wordflags Plural|LastWasCommon # You can also subtract flags: # wordflags All-Plural|UpperCase # will set all flags except those two. The default is All. # # Title "string" -- the title of the database (unused) # administrator_email -- unused # data_owner_email -- unused # url -- unused #