getting_started.rst

/sps/km3net/users/kmcprod/JTE_NEMOWATER/withMX/muon-CC/3-100GeV/JTE.KM3Sim.gseagen.muon-CC.3-100GeV-9.1E7-1bin-3.0gspec.ORCA115_9m_2016.99.root
/sps/km3net/users/mmoser/setenvAA_jpp9_cent_os7.sh
~$: tohdf5 -o testfile.h5 /sps/km3net/users/kmcprod/JTE_NEMOWATER/withMX/muon-CC/3-100GeV/JTE.KM3Sim.gseagen.muon-CC.3-100GeV-9.1E7-1bin-3.0gspec.ORCA115_9m_2016.99.root
++ tohdf5: Converting '/sps/km3net/users/kmcprod/JTE_NEMOWATER/withMX/muon-CC/3-100GeV/JTE.KM3Sim.gseagen.muon-CC.3-100GeV-9.1E7-1bin-3.0gspec.ORCA115_9m_2016.99.root'...
Pipeline and module initialisation took 0.002s (CPU 0.000s).
loading root....  /afs/.in2p3.fr/system/amd64_sl7/usr/local/root/v5.34.23/
loading aalib...  /pbs/throng/km3net/src/Jpp/v9.0.8454//externals/aanet//libaa.so
++ km3pipe.io.aanet.AanetPump: Reading metadata using 'JPrintMeta'
WARNING ++ km3pipe.io.aanet.MetaParser: Empty metadata
WARNING ++ km3pipe.io.aanet.AanetPump: No metadata found, this means no data provenance!
--------------------------[ Blob     250 ]---------------------------
--------------------------[ Blob     500 ]---------------------------
--------------------------[ Blob     750 ]---------------------------
--------------------------[ Blob    1000 ]---------------------------
--------------------------[ Blob    1250 ]---------------------------
--------------------------[ Blob    1500 ]---------------------------
--------------------------[ Blob    1750 ]---------------------------
--------------------------[ Blob    2000 ]---------------------------
--------------------------[ Blob    2250 ]---------------------------
--------------------------[ Blob    2500 ]---------------------------
--------------------------[ Blob    2750 ]---------------------------
--------------------------[ Blob    3000 ]---------------------------
--------------------------[ Blob    3250 ]---------------------------
EventFile io / wall time = 6.27259 / 73.9881 (8.47784 % spent on io.)
================================[ . ]================================
++ km3pipe.io.hdf5.HDF5Sink: HDF5 file written to: testfile.h5
============================================================
3457 cycles drained in 75.842898s (CPU 70.390000s). Memory peak: 177.71 MB
  wall  mean: 0.021790s  medi: 0.019272s  min: 0.015304s  max: 2.823921s  std: 0.049242s
  CPU   mean: 0.020330s  medi: 0.020000s  min: 0.010000s  max: 1.030000s  std: 0.018179s
++ tohdf5: File '/sps/km3net/users/kmcprod/JTE_NEMOWATER/withMX/muon-CC/3-100GeV/JTE.KM3Sim.gseagen.muon-CC.3-100GeV-9.1E7-1bin-3.0gspec.ORCA115_9m_2016.99.root' was converted.
~$: tohdf5 -h
Convert ROOT and EVT files to HDF5.

Usage:
    tohdf5 [options] FILE...
    tohdf5 (-h | --help)
    tohdf5 --version

Options:
    -h --help                       Show this screen.
    --verbose                       Print more output.
    --debug                         Print everything.
    -n EVENTS                       Number of events/runs.
    -o OUTFILE                      Output file (only if one file is converted).
    -j --jppy                       (Jpp): Use jppy (not aanet) for Jpp readout.
    --ignore-hits                   Don't read the hits.
    -e --expected-rows NROWS        Approximate number of events.  Providing a
                                    rough estimate for this (100, 1000000, ...)
                                    will greatly improve reading/writing speed
                                    and memory usage.
                                    Strongly recommended if the table/array
                                    size is >= 100 MB. [default: 10000]
    -t --conv-times-to-jte          Converts all MC times in the file to JTE
ptdump -v testfile.h5
/ (RootGroup) 'KM3NeT'
/event_info (Table(3457,), fletcher32, shuffle, zlib(5)) 'EventInfo'
  description := {
  "weight_w4": Float64Col(shape=(), dflt=0.0, pos=0),
  "weight_w3": Float64Col(shape=(), dflt=0.0, pos=1),
  "weight_w2": Float64Col(shape=(), dflt=0.0, pos=2),
  "weight_w1": Float64Col(shape=(), dflt=0.0, pos=3),
  "run_id": Int64Col(shape=(), dflt=0, pos=4),
  "timestamp": Int64Col(shape=(), dflt=0, pos=5),
  "nanoseconds": Int64Col(shape=(), dflt=0, pos=6),
  "mc_time": Float64Col(shape=(), dflt=0.0, pos=7),
  "event_id": Int64Col(shape=(), dflt=0, pos=8),
  "mc_id": Int64Col(shape=(), dflt=0, pos=9),
  "group_id": Int64Col(shape=(), dflt=0, pos=10)}
...
calibrate /sps/km3net/users/mmoser/det_files/orca_115strings_av23min20mhorizontal_18OMs_alt9mvertical_v1.detx testfile.h5
~/$: pip install orcasong
~/$: make_nn_images testfile.h5 geofile.detx configfile.toml
--- Documentation for every config parameter that is available ---

None arguments should be written as string: 'None'

Parameters
----------
output_dirpath : str
    Full path to the directory, where the orcasong output should be stored.
chunksize : int
    Chunksize (along axis_0) that is used for saving the OrcaSong output to a .h5 file.
complib : str
    Compression library that is used for saving the OrcaSong output to a .h5 file.
    All PyTables compression filters are available, e.g. 'zlib', 'lzf', 'blosc', ... .
complevel : int
    Compression level for the compression filter that is used for saving the OrcaSong output to a .h5 file.
n_bins : tuple of int
    Declares the number of bins that should be used for each dimension, e.g. (x,y,z,t).
    The option should be written as string, e.g. '11,13,18,60'.
det_geo : str
    Declares what detector geometry should be used for the binning. E.g. 'Orca_115l_23m_h_9m_v'.
do2d : bool
    Declares if 2D histograms, 'images', should be created.
do2d_plots : bool
    Declares if pdf visualizations of the 2D histograms should be created, cannot be called if do2d=False.
do2d_plots_n: int
    After how many events the event loop will be stopped (making the 2d plots in do2d_plots takes long time).
do3d : bool
    Declares if 3D histograms should be created.
do4d : bool
    Declares if 4D histograms should be created.
do4d_mode : str
    If do4d is True, what should be used as the 4th dim after xyz.
    Currently, only 'time' and 'channel_id' are available.
prod_ident : int
    Optional int identifier for the used mc production.
    This is e.g. useful, if you use events from two different mc productions, e.g. the 1-5GeV & 3-100GeV Orca 2016 MC.
    In this case, the events are not fully distinguishable with only the run_id and the event_id!
    In order to keep a separation, an integer can be set in the event_track for all events, such that they stay distinguishable.
timecut_mode : str
    Defines what timecut should be used in hits_to_histograms.py.
    Currently available:
    'timeslice_relative': Cuts out the central 30% of the snapshot. The value of timecut_timespan doesn't matter in this case.
    'trigger_cluster': Cuts based on the mean of the triggered hits.
    'None': No timecut. The value of timecut_timespan doesn't matter in this case.
timecut_timespan : str/None
    Defines what timespan should be used if a timecut is applied. Only relevant for timecut_mode = 'trigger_cluster'.
    Currently available:
    'all': [-350ns, 850ns] -> 20ns / bin (if e.g. 60 timebins)
    'tight-0': [-450ns, 500ns] -> 15.8ns / bin (if e.g. 60 timebins)
    'tight-1': [-250ns, 500ns] -> 12.5ns / bin (if e.g. 60 timebins)
    'tight-2': [-150ns, 200ns] -> 5.8ns / bin (if e.g. 60 timebins)
do_mc_hits : bool
    Declares if hits (False, mc_hits + BG) or mc_hits (True) should be processed.
data_cut_triggered : bool
    Cuts away hits that haven't been triggered.
data_cut_e_low : float
    Cuts away events that have an energy lower than data_cut_e_low.
data_cut_e_high : float
    Cuts away events that have an energy higher than data_cut_e_high.
data_cut_throw_away : float
    Cuts away random events with a certain probability (1: 100%, 0: 0%).
flush_freq : int
    After how many events the accumulated output should be flushed to the harddisk.
    A larger value leads to a faster orcasong execution, but it increases the RAM usage as well.

--- Documentation for every config parameter that is available ---