From AureoWiki
Jump to: navigation, search
Pangenome04-0298108BA0217611819-9771193COLECT-R 2ED133ED98HO 5096 0412JH1JH9JKD6008JKD6159LGA251M013MRSA252MSHR1132MSSA476MW2Mu3Mu50N315NCTC8325NewmanRF122ST398T0131TCH60TW20USA300_FPR3757USA300_TCH1516VC40

Summary[edit source | edit]

  • pan ID?: SAUPAN004550000
  • symbol?: hsdS2
  • description?: specificity determinant HsdS

      descriptions from strain specific annotations:

    • specificity determinant HsdS
    • type I restriction-modification enzyme, S subunit, EcoA family protein
    • type I restriction-modification system subunit S
    • type I restriction modification system, site specificity determination subunit
    • type I restriction-modification enzyme, S subunit
    • pseudogene
    • restriction modification DNA specificity protein
    • restriction modification system DNA specificity subunit
    • type I restriction modification DNA specificity domain protein
    • Type I restriction-modification system, specificity subunit S
    • type I restriction-modification system specificity protein
    • type I site-specific deoxyribonuclease specificity subunit
  • strand?: -
  • coordinates?: 4771132..4780171
  • synteny block?: BlockID0035400
  • occurrence?: in 59% of 32 strains

Orthologs[edit source | edit]

    COL:
    SACOL1861 (hsdS)
    N315:
    NCTC8325:
    Newman:
    NWMN_1699 (hsdS)
    USA300_FPR3757:
    04-02981:
    SA2981_1762
    08BA02176:
    11819-97:
    71193:
    ECT-R 2:
    ECTR2_1640
    ED133:
    ED98:
    SAAV_1815 (hsdS)
    HO 5096 0412:
    JH1:
    SaurJH1_1894
    JH9:
    SaurJH9_1859
    JKD6008:
    JKD6159:
    LGA251:
    M013:
    MRSA252:
    MSHR1132:
    MSSA476:
    SAS1731
    Mu3:
    SAHV_1792
    Mu50:
    SAV1807
    MW2:
    MW1750 (hsdS)
    RF122:
    SAB1667c
    ST398:
    T0131:
    SAT0131_01927
    TCH60:
    TW20:
    SATW20_18040
    USA300_TCH1516:
    USA300HOU_1799 (hsdS2)
    VC40:
    SAVC_08270

Genome Viewer[edit source | edit]

COL chromosome
N315 chromosome
NCTC8325 chromosome
Newman chromosome
USA300_FPR3757 chromosome

Alignments[edit source | edit]

  • alignment of orthologues:
    CLUSTAL format alignment by MAFFT L-INS-i (v7.205)


    COL             MSNTQKKNVPELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIR
    N315            MSNTQTKNVPELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIR
    NCTC8325        MSNTQTKNVPELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIR
    Newman          MSNTQKKNVPELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIR
    USA300_FPR3757  MSNTQKKNVPELRFPGFEGEWEEKKLGNLTTKIGSGKTPKGGSENYTNKGIPFLRSQNIR
                    *****.******************************************************

    COL             NGKLNLNDLVYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCI
    N315            NGKLNLNDLVYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCI
    NCTC8325        NGKLNLNDLVYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCI
    Newman          NGKLNLNDLVYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCI
    USA300_FPR3757  NGKLNLNDLVYISKDIDDEMKNSRTYYGDVLLNITGASIGRTAINSIVETHANLNQHVCI
                    ************************************************************

    COL             IRLKKEYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIG
    N315            IRLKKEYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIG
    NCTC8325        IRLKKEYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIG
    Newman          IRLKKEYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIG
    USA300_FPR3757  IRLKKEYYYNFFGQYLLSRKGKRKIFLAQSGGSREGLNFKEIANLKIFTPTIFEEQQKIG
                    ************************************************************

    COL             KFFSKLDRQIELEEQKLELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKDIFIF
    N315            QFFSKLDQQIELEEQKLELLQQQKKCYIQKIFSQELRFKDEEGNYYKGWNKKQLKDVLEF
    NCTC8325        KFFSKLDRQIELEEQKLELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKDIFIF
    Newman          KFFSKLDRQIELEEQKLELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKDIFIF
    USA300_FPR3757  KFFSKLDRQIELEEQKLELLQQQKKGYMQKIFTQELRFKDENGEEYPEWENKFIKDIFIF
                    :******:***************** *:****:********:*: *  *::* :**:: *

    COL             ENNR-----RKPITSSLREKGLYPYYGATGIIDYVKDYLFNNEERL---------LIGED
    N315            SNKRTINENEYPVLTSSRQ-------GLILQSDYYKDRKTFAESNIGYFILPKNHITYRS
    NCTC8325        ENNR-----RKPITSSLREKGLYPYYGATGIIDYVKDYLFNNEERL---------LIGED
    Newman          ENNR-----RKPITSSLREKGLYPYYGATGIIDYVKDYLFNNEERL---------LIGED
    USA300_FPR3757  ENNR-----RKPITSSLREKGLYPYYGATGIIDYVKDYLFNNEERL---------LIGED
                    .*:*     . *: :* *:       *     ** **     *..:         :  ..

    COL             GAKWGQFETSSFIANGQYWVNNHAHVVKSNDHNLFFMNYYLNF---KELRAFVTGNAPAK
    N315            RSDDGIFKFNLNLMIDVGIISKYYPVFKGIDANQYYLTLHLNYQLKKEYIKYATGTSQLV
    NCTC8325        GAKWGQFETSSFIANGQYWVNNHAHVVKSNDHNLFFMNYYLNF---KELRAFVTGNAPAK
    Newman          GAKWGQFETSSFIANGQYWVNNHAHVVKSNDHNLFFMNYYLNF---KELRAFVTGNAPAK
    USA300_FPR3757  GAKWGQFETSSFIANGQYWVNNHAHVVKSNDHNLFFMNYYLNF---KELRAFVTGNAPAK
                     :. * *: .  :  .   :.::  *.*. * * :::. :**:   **   :.**.:   

    COL             LTHANLCNINLKIPCLTEQDKVSALLKSIDNKMNNQMNRIELLKERKKGLLQKMFI
    N315            LSQKDLQNIKTKLPSYEEQQKIGDFFSEIDRLVEKQSSKVGRLKVRKKELLQKMFV
    NCTC8325        LTHANLCNINLKIPCLTEQDKVSALLKSIDNKMNNQMNRIELLKERKKGLLQKMFI
    Newman          LTHANLCNINLKIPCLTEQDKVSALLKSIDNKMNNQMNRIELLKERKKGLLQKMFI
    USA300_FPR3757  LTHANLCNINLKIPCLTEQDKVSALLKSIDNKMNNQMNRIELLKERKKGLLQKMFI
                    *:: :* **: *:*.  **:*:. ::..**. :::* .::  ** *** ******: