Bandit Commands Reference

This article is a reference for: http://support.emcien.com/help/article/link/using-bandit

An overview of the command line parameters can be seen by simply calling Bandit with the “-h” parameter (bandit -h), in which you will get something like this:

   -h                help:        Emcien Bandit version 50 (www.emcien.com)
   -d CHAR    dependent:    name of the dependent category
   -f CHAR    form:        form of output file (W for wide or R for receipt)
   -s CHAR    separator:    Separator character
   -t CHAR    table:        name of table file
   -b CHAR    bands:        name of bands file
   -r INT    randomize:    1 for 10pct test; 2 for 20pct test        
   -v INT         verbosity:    Print out lots of extra information on stderr

There are 9 commands you can pass (7 numerical banding and 2 transformational):

  • freq – Similar frequency bands – five bands are chosen to divide frequency of transactions into similar sizes. This command can include the number of bands you wish to have. (i.e. “freq8” would create 8 bands based on frequency)
  • wide – Equal width bands – range of numerical values is divided evenly into five bands. This command can include the number of bands you wish to have. (i.e. “wide7” would create 7 equal width bands)
  • fuzz – Fuzzy bands organizes the data into five bands with sizes of approximately 10%, 20%, 40%, 20%, 10% of the transactions. Starts with the highest value, places that into the middle 40% and builds left and right of that to create the other bands.
  • stat – builds bands based on the standard deviation
  • half – builds bands based on half of the standard deviation
  • info – Information Banding – based on “information theory”, recursively breaks down the data into chunks with the goal to reduce the entropy/disorder (basically trying to cleanly organize bands such that predictability is maximized). This banding type REQUIRES the dependent parameter (“-d”) to be passed in the command line.
  • user – User defined bands allow for custom banding of the numeric ranges and assigning them names. “User” type bands REQUIRE the use of a “bands.csv” input file. See section on User Defined Banding for more information.
  • cat – Categorical Data command tells bandit to leave the data alone and pass it onto the output file as is.
  • skip – Skips the column of data and does not include it in the output file at all.