MacSyFinder provides different types of outputs. At each run, MacSyFinder creates a new folder, whose name is based on a fixed prefix and a random suffix, for instance “macsyfinder-20130128_08-57-46”. MacSyFinder outputs are stored in this run-specific folder.
Raw Hmmer outputs are provided, as long with processed tabular outputs that includes hits filtered as specified by the user. For instance, the Hmmer search for SctC homologs with the corresponding profile will produce as a result two files: “sctC.search_hmm.out” and “sctC.res_hmm_extract”.
The processed output “sctC.res_hmm_extract” recalls on the first lines the parameters used for hits filtering and relevant information on the matches, as for instance:
# gene: sctC extract from /Users/bob/macsyfinder_results/
macsyfinder-20130128_08-57-46/sctC.search_hmm.out hmm output
# profile length= 544
# i_evalue threshold= 0.001000
# coverage threshold= 0.500000
# hit_id replicon_name position_hit hit_sequence_length gene_name gene_system i_eval score
profile_coverage sequence_coverage begin end
PSAE001c01_006940 PSAE001c01 3450 803 sctC T3SS 1.1e-41 141.6
0.588235 0.419676 395 731
PSAE001c01_018920 PSAE001c01 4634 776 sctC T3SS 9.2e-48 161.7
0.976103 0.724227 35 596
PSAE001c01_031420 PSAE001c01 5870 658 sctC T3SS 2.7e-52 176.7
0.963235 0.844985 49 604
PSAE001c01_051090 PSAE001c01 7801 714 sctC T3SS 1.9e-46 157.4
0.571691 0.463585 374 704
Note
Each tabular output file contains a header line describing each column in the output.
Different types of tabular outputs are provided. Headers are provided with the content of the lines in the file.
- macsyfinder.tab - for “ordered” datasets only (db_type is “ordered_replicon” or “gembase”). It provides a summary of the number of each type of systems that have been detected.
- macsyfinder.report - contains all the sequence identifiers of proteins detected as being part of a system, along with statistics on the Hmmer hit, and the status of the component in the system.
- macsyfinder.summary - contains a line of information for each detected system.
For each replicon, a line gives the occurrences of systems that were asked for detection. For example, if the detection was run for the Flagellum and the T6SS, the output will look like:
#Replicon Flagellum_single_locus Flagellum_multi_loci T6SS_single_locus T6SS_multi_loci
escherichia06 1 1 1 0
which means that this “escherichia06” genome harbors 1 flagellum in a single locus, 1 flagellum scattered on multiple loci, and 1 T6SS in a single locus.
Three specific output files are built to store information on the MacSyFinder execution:
- macsyfinder.out - contains information on the procedure during systems detection: clusters found, decisions made for system inference... The same information is also displayed on the standard output.
- macsyfinder.conf - contains the configuration information of the run. It is useful to recover the parameters used for the run.
- macsyfinder.log - the log file, contains raw information on the run. Please send it to us with any bug report.
This file in JSON format is used by MacSyView, for graphical output purpose. It must be loaded through MacSyView to graphically visualize detected systems. For more details, see MacSyView’s description.