Experiment Name in Reporter
PETsARD adopts a standardized naming convention for identifying and tracking experiments. This document explains the two main naming formats and their usage.
Experiment Name Formats
Experiment Tuple
full_expt_tuple
is a tuple consisting of a module name and an experiment name:
(module_name, experiment_name)
This format is primarily used when creating reports:
# Basic usage
reporter.create({
('Synthesizer', 'exp1'): df_synthetic,
('Evaluator', 'eval1_[global]'): df_results
})
# Multiple experiment comparison
reporter.create({
('Synthesizer', 'exp1_epsilon0.1'): df_low_privacy,
('Synthesizer', 'exp2_epsilon1.0'): df_high_privacy,
('Synthesizer', 'exp3_epsilon5.0'): df_baseline
})
# Different evaluation levels
reporter.create({
('Evaluator', 'eval1_[global]'): global_results,
('Evaluator', 'eval1_[columnwise]'): column_results,
('Evaluator', 'eval1_[pairwise]'): pair_results
})
Experiment String
full_expt_name
is a string that concatenates the module name and experiment name with a hyphen:
{module_name}-{experiment_name}
This format is used for output filenames:
# Synthetic data files
petsard_Synthesizer-exp1.csv
petsard_Synthesizer-exp2_epsilon1.0.csv
# Evaluation report files
petsard[Report]_Evaluator-eval1_[global].csv
petsard[Report]_Evaluator-eval1_[columnwise].csv
Naming Examples
Data Synthesis Experiments
# Comparing different methods
reporter.create({
('Synthesizer', 'exp1_ctgan'): ctgan_results,
('Synthesizer', 'exp2_tvae'): tvae_results,
('Synthesizer', 'exp3_copula'): copula_results
})
# Output files:
# petsard_Synthesizer-exp1_ctgan.csv
# petsard_Synthesizer-exp2_tvae.csv
# petsard_Synthesizer-exp3_copula.csv
Privacy Parameter Experiments
# Privacy parameter adjustment
reporter.create({
('Synthesizer', 'exp1_eps0.1_delta1e-5'): low_privacy_df,
('Synthesizer', 'exp2_eps1.0_delta1e-5'): med_privacy_df,
('Synthesizer', 'exp3_eps10.0_delta1e-5'): high_privacy_df
})
# Output files:
# petsard_Synthesizer-exp1_eps0.1_delta1e-5.csv
# petsard_Synthesizer-exp2_eps1.0_delta1e-5.csv
# petsard_Synthesizer-exp3_eps10.0_delta1e-5.csv
Evaluation Experiments
# Multi-level evaluation
reporter.create({
('Evaluator', 'privacy_risk_[global]'): global_privacy,
('Evaluator', 'data_quality_[columnwise]'): column_quality,
('Evaluator', 'correlation_[pairwise]'): pair_correlation
})
# Output files:
# petsard[Report]_Evaluator-privacy_risk_[global].csv
# petsard[Report]_Evaluator-data_quality_[columnwise].csv
# petsard[Report]_Evaluator-correlation_[pairwise].csv
Naming Guidelines
Module Names
- Use standard module names: ‘Synthesizer’, ‘Evaluator’, ‘Processor’, etc.
- Case sensitivity must match exactly
Experiment Names
- Use meaningful prefixes: ’exp’, ’eval’, ’test’, etc.
- Separate different parts with underscores: method names, parameter settings, etc.
- Use square brackets for evaluation levels: [global], [columnwise], [pairwise]
Parameter Encoding
- Use abbreviations for parameter names: eps (epsilon), del (delta), etc.
- Use concise representation for values: 1e-5, 0.1, etc.
- Connect multiple parameters with underscores: eps0.1_del1e-5