@@ -199,24 +199,25 @@ While reading the `.janno` file `trident` trims all leading and trailing whitesp
199199` ` ` default
200200Usage: trident init ((-p|--genoOne FILE) | --genoFile FILE --snpFile FILE
201201 --indFile FILE |
202- --bedFile FILE --bimFile FILE --famFile FILE |
203- --vcfFile FILE) [--snpSet SET] (-o|--outPackagePath DIR)
202+ --bedFile FILE --bimFile FILE --famFile FILE |
203+ --vcfFile FILE) [--snpSet SET] (-o|--outPackagePath DIR)
204204 [-n|--outPackageName STRING] [--minimal]
205205
206206 Create a new Poseidon package from genotype data
207207
208208Available options:
209209 -h,--help Show this help text
210210 -p,--genoOne FILE One of the input genotype data files. Expects .bed,
211- .bed.gz, .bim, .bim.gz or .fam for PLINK, or .geno,
212- .geno.gz, .snp, .snp.gz or .ind for EIGENSTRAT. The
213- other files must be in the same directory and must
214- have the same base name. If a gzipped file is given,
215- it is assumed that the file pairs (.geno.gz, .snp.gz)
216- or (.bim.gz, .bed.gz) are both zipped, but not the
217- .fam or .ind file. If a .ind or .fam file is given,
218- it is assumed that none of the file triples is
219- zipped. For VCF please see option --vcfFile
211+ .bed.gz, .bim, .bim.gz or .fam for PLINK, .geno,
212+ .geno.gz, .snp, .snp.gz or .ind for EIGENSTRAT,
213+ or.vcf or .vcf.gz for VCF. In case of EIGENSTRAT and
214+ PLINK, the two other files must be in the same
215+ directory and must have the same base name. If a
216+ gzipped file is given, it is assumed that the file
217+ pairs (.geno.gz, .snp.gz) or (.bim.gz, .bed.gz) are
218+ both zipped, but not the .fam or .ind file. If a .ind
219+ or .fam file is given, it is assumed that none of the
220+ file triples is zipped.
220221 --genoFile FILE Eigenstrat genotype matrix, optionally gzipped.
221222 Accepted file endings are .geno, .geno.gz
222223 --snpFile FILE Eigenstrat snp positions file, optionally gzipped.
@@ -358,7 +359,7 @@ Usage: trident forge ((-d|--baseDir DIR) |
358359 --vcfFile FILE) [--snpSet SET])
359360 [--forgeFile FILE | (-f|--forgeString DSL)]
360361 [--selectSnps FILE] [--intersect] [--outFormat FORMAT]
361- [--onlyGeno | --minimal | --preservePyml]
362+ [--onlyGeno | --minimal | --preservePyml] [-z|--zip]
362363 (-o|--outPackagePath DIR) [-n|--outPackageName STRING]
363364 [--packagewise] [--outPlinkPopName MODE] [--ordered]
364365
@@ -369,15 +370,16 @@ Available options:
369370 -h,--help Show this help text
370371 -d,--baseDir DIR A base directory to search for Poseidon packages.
371372 -p,--genoOne FILE One of the input genotype data files. Expects .bed,
372- .bed.gz, .bim, .bim.gz or .fam for PLINK, or .geno,
373- .geno.gz, .snp, .snp.gz or .ind for EIGENSTRAT. The
374- other files must be in the same directory and must
375- have the same base name. If a gzipped file is given,
376- it is assumed that the file pairs (.geno.gz, .snp.gz)
377- or (.bim.gz, .bed.gz) are both zipped, but not the
378- .fam or .ind file. If a .ind or .fam file is given,
379- it is assumed that none of the file triples is
380- zipped. For VCF please see option --vcfFile
373+ .bed.gz, .bim, .bim.gz or .fam for PLINK, .geno,
374+ .geno.gz, .snp, .snp.gz or .ind for EIGENSTRAT,
375+ or.vcf or .vcf.gz for VCF. In case of EIGENSTRAT and
376+ PLINK, the two other files must be in the same
377+ directory and must have the same base name. If a
378+ gzipped file is given, it is assumed that the file
379+ pairs (.geno.gz, .snp.gz) or (.bim.gz, .bed.gz) are
380+ both zipped, but not the .fam or .ind file. If a .ind
381+ or .fam file is given, it is assumed that none of the
382+ file triples is zipped.
381383 --genoFile FILE Eigenstrat genotype matrix, optionally gzipped.
382384 Accepted file endings are .geno, .geno.gz
383385 --snpFile FILE Eigenstrat snp positions file, optionally gzipped.
@@ -455,6 +457,8 @@ Available options:
455457 CHANGELOG file (if available) are copied to the
456458 output package. Only works for a singular source
457459 package.
460+ -z,--zip Should the resulting genotype- and snp-files be
461+ gzipped?
458462 -o,--outPackagePath DIR Path to the output package directory.
459463 -n,--outPackageName STRING
460464 The output package name. This is optional: If no name
@@ -716,25 +720,26 @@ Usage: trident genoconvert ((-d|--baseDir DIR) |
716720 --snpFile FILE --indFile FILE |
717721 --bedFile FILE --bimFile FILE --famFile FILE |
718722 --vcfFile FILE) [--snpSet SET])
719- --outFormat FORMAT [--onlyGeno ]
720- [-o|--outPackagePath DIR ] [--removeOld ]
721- [--outPlinkPopName MODE] [--onlyLatest ]
723+ --outFormat FORMAT [-o|--outPackagePath DIR ]
724+ [--removeOld] [--outPlinkPopName MODE ] [--onlyLatest ]
725+ [-z|--zip ]
722726
723727 Convert the genotype data in a Poseidon package to a different file format
724728
725729Available options:
726730 -h,--help Show this help text
727731 -d,--baseDir DIR A base directory to search for Poseidon packages.
728732 -p,--genoOne FILE One of the input genotype data files. Expects .bed,
729- .bed.gz, .bim, .bim.gz or .fam for PLINK, or .geno,
730- .geno.gz, .snp, .snp.gz or .ind for EIGENSTRAT. The
731- other files must be in the same directory and must
732- have the same base name. If a gzipped file is given,
733- it is assumed that the file pairs (.geno.gz, .snp.gz)
734- or (.bim.gz, .bed.gz) are both zipped, but not the
735- .fam or .ind file. If a .ind or .fam file is given,
736- it is assumed that none of the file triples is
737- zipped. For VCF please see option --vcfFile
733+ .bed.gz, .bim, .bim.gz or .fam for PLINK, .geno,
734+ .geno.gz, .snp, .snp.gz or .ind for EIGENSTRAT,
735+ or.vcf or .vcf.gz for VCF. In case of EIGENSTRAT and
736+ PLINK, the two other files must be in the same
737+ directory and must have the same base name. If a
738+ gzipped file is given, it is assumed that the file
739+ pairs (.geno.gz, .snp.gz) or (.bim.gz, .bed.gz) are
740+ both zipped, but not the .fam or .ind file. If a .ind
741+ or .fam file is given, it is assumed that none of the
742+ file triples is zipped.
738743 --genoFile FILE Eigenstrat genotype matrix, optionally gzipped.
739744 Accepted file endings are .geno, .geno.gz
740745 --snpFile FILE Eigenstrat snp positions file, optionally gzipped.
@@ -756,12 +761,13 @@ Available options:
756761 (default: Other)
757762 --outFormat FORMAT the format of the output genotype data: EIGENSTRAT or
758763 PLINK.
759- --onlyGeno Should only the resulting genotype data be returned?
760- This means the output will not be a Poseidon package.
761- -o,--outPackagePath DIR Path to the output package directory. This is
762- optional: If no path is provided, then the output is
763- written to the directories where the input genotype
764- data file (.bed/.geno) is stored. (default: Nothing)
764+ -o,--outPackagePath DIR Path for the converted genotype files to be written
765+ to. If a path is provided, only the converted
766+ genotype files are written out, with no change of the
767+ original package. If no path is provided, genotype
768+ files will be converted in-place, including a change
769+ in the POSEIDON.yml file to yield an updated valid
770+ package (default: Nothing)
765771 --removeOld Remove the old genotype files when creating the new
766772 ones.
767773 --outPlinkPopName MODE Where to write the population/group name into the FAM
@@ -771,6 +777,8 @@ Available options:
771777 --onlyLatest Consider only the latest versions of packages, or the
772778 groups and individuals within the latest versions of
773779 packages, respectively.
780+ -z,--zip Should the resulting genotype- and snp-files be
781+ gzipped?
774782` ` `
775783
776784</details>
@@ -1104,15 +1112,16 @@ Available options:
11041112 compatible with trident.
11051113 --pyml FILE Path to a POSEIDON.yml file.
11061114 -p,--genoOne FILE One of the input genotype data files. Expects .bed,
1107- .bed.gz, .bim, .bim.gz or .fam for PLINK, or .geno,
1108- .geno.gz, .snp, .snp.gz or .ind for EIGENSTRAT. The
1109- other files must be in the same directory and must
1110- have the same base name. If a gzipped file is given,
1111- it is assumed that the file pairs (.geno.gz, .snp.gz)
1112- or (.bim.gz, .bed.gz) are both zipped, but not the
1113- .fam or .ind file. If a .ind or .fam file is given,
1114- it is assumed that none of the file triples is
1115- zipped. For VCF please see option --vcfFile
1115+ .bed.gz, .bim, .bim.gz or .fam for PLINK, .geno,
1116+ .geno.gz, .snp, .snp.gz or .ind for EIGENSTRAT,
1117+ or.vcf or .vcf.gz for VCF. In case of EIGENSTRAT and
1118+ PLINK, the two other files must be in the same
1119+ directory and must have the same base name. If a
1120+ gzipped file is given, it is assumed that the file
1121+ pairs (.geno.gz, .snp.gz) or (.bim.gz, .bed.gz) are
1122+ both zipped, but not the .fam or .ind file. If a .ind
1123+ or .fam file is given, it is assumed that none of the
1124+ file triples is zipped.
11161125 --genoFile FILE Eigenstrat genotype matrix, optionally gzipped.
11171126 Accepted file endings are .geno, .geno.gz
11181127 --snpFile FILE Eigenstrat snp positions file, optionally gzipped.
0 commit comments