Book Contents

Book Index

Next Topic

Home

Generating Custom Data Files

Each example database provided with Vertica includes a sample data generator program that produces output files whose names correspond to the tables in the logical schema. Each data generator has a similar set of input parameters that allow you to specify the number of rows of data to generate for any subset of the tables. To see a detailed list of the parameters for any example database, examine the README file in the example database directory.

Syntax

 

./example_gen [ --files files ]

[ --seed seed ]

[ --time_file pathname ]

[ --fact_table_name rows ]

[ --dimension_table_name rows ] ...

Semantics

  

Parameter

Description

example

is one of the following:

clickstream

credithistory

retail

stock

telecom

files files

splits the fact table data into the specified number of files. By default, the data generator produces a single, unnumbered fact table data file. If you specify a value of two (2) or more, the data generator numbers the files by appending an underscore character (_) and three digits to the file name, starting at _001. For example:

./retail_gen --files 3

produces:

Retail_Sales_Fact_001.tbl

Retail_Sales_Fact_002.tbl

Retail_Sales_Fact_003.tbl

Default: 1

seed seed

the seed for the pseudo-random number generator. If you use the same seed each time you run the data generator, you will get the same data files (excluding external factors).

Default: 20177

time_file pathname

the pathname of the pre-computed time data input file used to generate the Date Dimension table.

Default: ./Time.txt (supplied by Vertica; contains data for the years 2000-2004).

fact_table_name rows

the name of the fact table in example followed by the number of rows of data to generate for the fact table.

Default: 5,000,000 (five million)

dimension_table_name rows

the name of a dimension table in example (other than the Date_Dimension table) followed by the number of rows of data to generate for that dimension table.

Notes

Examples

./retail_gen

 

./retail_gen --files 3

 

/home/dbadmin/Retail_Schema/retail_gen \

--time_file /home/dbadmin/Retail_Schema/Time.txt \

--retail_sales_fact 100000 \

--product_dimension 500 \

--store_dimension 50 \

--promotion_dimension 100