Each example database provided with Vertica includes a sample data generator program that produces output files whose names correspond to the tables in the logical schema. Each data generator has a similar set of input parameters that allow you to specify the number of rows of data to generate for any subset of the tables. To see a detailed list of the parameters for any example database, examine the README file in the example database directory.
Syntax
./example_gen [ --files files ]
[ --seed seed ]
[ --time_file pathname ]
[ --fact_table_name rows ]
[ --dimension_table_name rows ] ...
Semantics
Parameter |
Description |
example |
is one of the following: clickstream credithistory retail stock telecom |
files files |
splits the fact table data into the specified number of files. By default, the data generator produces a single, unnumbered fact table data file. If you specify a value of two (2) or more, the data generator numbers the files by appending an underscore character (_) and three digits to the file name, starting at _001. For example: ./retail_gen --files 3 produces: Retail_Sales_Fact_001.tbl Retail_Sales_Fact_002.tbl Retail_Sales_Fact_003.tbl Default: 1 |
seed seed |
the seed for the pseudo-random number generator. If you use the same seed each time you run the data generator, you will get the same data files (excluding external factors). Default: 20177 |
time_file pathname |
the pathname of the pre-computed time data input file used to generate the Default: ./Time.txt (supplied by Vertica; contains data for the years 2000-2004). |
fact_table_name rows |
the name of the fact table in example followed by the number of rows of data to generate for the fact table. Default: 5,000,000 (five million) |
dimension_table_name rows |
the name of a dimension table in example (other than the |
Notes
Date_Dimension
tables is determined by the time data input file supplied with the example database.Examples
./retail_gen
./retail_gen --files 3
/home/dbadmin/Retail_Schema/retail_gen \
--time_file /home/dbadmin/Retail_Schema/Time.txt \
--retail_sales_fact 100000 \
--product_dimension 500 \
--store_dimension 50 \
--promotion_dimension 100