Lab Resources Bulk Load

  • add something here

Fields

AttachScores

This adds new scores to the ORFs.

  1. Make the Lab Bulk Load window active in the Board.
  2. On the Bulk Load menu click Attach Scores.

Note

  • The currently attached scores will be replaced with the new ones.
  • TODO: file format and an example

Attach DB References

This adds DB references to ORFs.

  1. Make the Lab Bulk Load window active in the Board.
  2. On the Bulk Load menu click Attach Scores.

Note

  • A Sesame Administrator needs to be contacted to remove or correct an existing DB reference.
  • TODO: file format and an example

Load ORFs From CSV File

This adds ORFs to Sesame.

  1. Make the Lab Bulk Load window active in the Board.
  2. On the Bulk Load menu click Attach Scores.
  3. In the folder list, locate and open the folder that contains the file.
  4. Click the file, and then click Load CSV File.

Note

  • The CSV file consists of the header row followed by rows that contain comma separated data.
  • Example File
  • The header, and the column order must be exactly as in the Example File.
  • Commas in the field values are allowed only in quoted fields, i.e. "this,that"
  • There should be no empty rows between the header and the data rows, nor among the data rows.
  • Column headers and definitions
    src_no The Src No Field from the System Resource Source Oraganism or Lab Resource Source Organism. If a particular Source Organism is not in the resource table, contact a Sesame Administrator to add it first.
    lab_id Lab specific Id.
    Up to 16 characters.
    chromosome Chromosome.
    Up to 8 characters.
    main_id This is the 'Main Id', but not the unique Sesame Id. The Unique Sesame Id is autogenerated during the upload.
    Up to 20 characters.
    gene_locus Gene Locus.
    Up to 16 characters.
    gene_locator Gene Locator.
    Up to 8 characters.
    prot_name Protein Name.
    Up to 300 characters.
    prot_category Protein Category.
    Choices:
    H - Hypothetical
    K - Known
    P - Putative
    U - Unknown
    1 character.
    pred_strclass Predicted Structure Class.
    Choices:
    A - All-Alpha
    B - All-Beta
    C - Alpha-Beta
    I - Irregular
    U - Unclassified
    D - Membrane All-Alpha
    E - Membrane All-Beta
    F - Membrane Alpha-Beta
    G - Membrane Irregular
    M - Membrane Unclassified
    1 character.
    struct_known Structure Known.
    Choices:
    Y - Yes
    Empty field if the structure is not known.
    Up to 1 character.
    paralogue_id Paralogue Id.
    Up to 100 characters.
    residuelength Amino Acid Sequence Length.
    The known value, or -1.
    It will be recalculated during the upload.
    gcodelength Nucleotide Sequence Length.
    The known value, or -1.
    It will be recalculated during the upload.
    aacode Amino Acid Sequence.
    If there is a gcode, it will be recalculated from it during the upload.
    Up to 8000 characters.
    nofintrons Number of Introns.
    The known value, or -1 if not known.
    sigpeplength Signal Peptide Length.
    The known value, or -1 if not known.
    molweight Molecular Weight. The known value, or -1.
    It will be recalculated during the upload.
    e280nm Molar Absorption Coefficient, or Molar Extinction Coefficient.
    The known value, or -1.
    It will be recalculated during the upload.
    pi Isoelectric Point.
    The known value, or -1.
    It will be recalculated during the upload.
    parent_no Parent Number.
    The unique Sesame ORF No or Database No of the parent ORF already in Sesame.
    0 if there is no parent.
    orf_type ORF Type.
    Choices:
    0 - Original ORF
    1 - Chunk
    2 - Splice
    3 - Mutant
    4 - Extension
    5 - RNA
    6 - DNA
    7 - Amino Acid Sequence Only
    sourcedb_name Source Database Name.
    Up to 20 characters.
    sourcedb_version Source Database Version.
    Up to 20 characters.
    sourcedb_date Source Database Date.
    Date in format yyyy-mm-dd, or empty field.
    If empty field, the current date will be inserted durint the upload.
    noftd Number of Predicted Transmembrane Segments.
    -1 if not known.
    closesthomologue Closest Homologue.
    Up to 400 characters.
    coiledcoil Coiled Coil.
    Percentage, a decimal number between 0.0 and 100.0
    -1 if not known.
    gccontent G + C Content.
    Percentage, a decimal number between 0.0 and 100.0
    -1 if not known.
    lowcomplexity Low Complexity.
    Percentage, a decimal number between 0.0 and 100.0
    -1 if not known.
    position Position.
    Describes the start end end of the segmentsin the genome.
    Up to 2000 characters.
    manuallyedited Manually Edited.
    Choices:
    Y - Yes
    N - No
    1 character.
    gcode Nucleotide Sequence.
    Up to 24000 characters.
    pref Nucleotide Sequence Prefix.
    Up to 4000 characters.
    suf Nucleotide Sequence Suffix.
    Up to 4000 characters.
    targetscore Target Score.
    Up to 30 characters.
    pred_cleavsite Predicted Cleavage Site.
    Up to 30 characters.
    cluster_seed Cluster Seed.
    Up to 100 characters.
    pfamdomains PFAM Domains.
    Empty field if not known.
    If there is more than one, separate them by commas, and doublequote.
    Example for a field consisting of 2 PFAM strings:
    "PF02170 PAZ domain 8.4e-50,PF00270 DEAD/DEAH box helicase 2.3e-06"
    Each and every PFAM string can be up to 300 characters.
    dbreferences External Database reference strings.
    Empty field if none.
    If there is more than one, separate them by commas, and doublequote.
    Every dbreference must be of the form:
    db_name:db_id
    Predefined db_name strings:
    BMRB, CAS, CCRIS, CHEBI, EINECS, FEMA, GenBank, HSDB, KEGG, MAtDB, MMCD, NSC, OMIM, PDB, PIR, PUBCHEM CID, PUBCHEM SID, SWISS-PROT, TIGR, TrEMBL
    Example for a field consisting of 3 external database references:
    "GenBank:xxx,OMIM:nnn,SWISS-PROT:zzz"
    Each and every db_name is up to 16 characters.
    Each and every db_id is up to 24 characters.
  • The Lab Bulk Load window will show the line number of the lines that have been loaded. Note, that Line 1 is the header line.

Load SDF Data

This...

  1. Make the Lab Bulk Load window active in the Board.
  2. On the Bulk Load menu click Attach Scores.

Note

CrystalFarm

This...

  1. Make the Lab Bulk Load window active in the Board.
  2. On the Bulk Load menu click Attach Scores.

Note

General Comments

  • To open the Lab Bulk Load window in the Board, on the Options menu select Lab Resources then click Bulk Load.

See also

Copyright © 1999-2008 Zsolt Zolnai, and the University of Wisconsin - Madison