From AureoWiki
Jump to navigation Jump to search
PangenomeCOLN315NCTC8325NewmanUSA300_FPR375704-0298108BA0217611819-97685071193ECT-R 2ED133ED98HO 5096 0412JH1JH9JKD6008JKD6159LGA251M013MRSA252MSHR1132MSSA476MW2Mu3Mu50RF122ST398T0131TCH60TW20USA300_TCH1516VC40

Summary[edit | edit source]

  • pan ID?: SAUPAN004553000
  • symbol?:
  • synonym:
  • description?: ATP-binding protein

      descriptions from strain specific annotations:

    • ATP-binding protein
    • NTPase
    • pseudogene
  • strand?: -
  • coordinates?: 4771706..4777825
  • synteny block?: BlockID0035430
  • occurrence?: in 30% of 33 strains

Suggest a pan gene symbol of "avs2" and a description of "AVAST bacteriophage defense system type II" to harmonize with PMID 36302390 with further validation information at DefenseFinder. AVAST systems are STAND (signal transduction ATPase with numerous domains) NTPases that recognize phage large subunit terminase proteins (C-terminal domain) and trigger abortive infection via the N-terminal effector domain (i.e. nuclease, protease, etc) per PMID 35951700.

Orthologs[edit | edit source]

    COL:
    N315:
    NCTC8325:
    Newman:
    USA300_FPR3757:
    04-02981:
    08BA02176:
    11819-97:
    6850:
    71193:
    ECT-R 2:
    ED133:
    ED98:
    HO 5096 0412:
    JH1:
    JH9:
    JKD6008:
    JKD6159:
    LGA251:
    M013:
    MRSA252:
    MSHR1132:
    MSSA476:
    SAS1729
    Mu3:
    Mu50:
    MW2:
    MW1749
    RF122:
    ST398:
    T0131:
    SAT0131_01926
    TCH60:
    TW20:
    SATW20_18000
    USA300_TCH1516:
    USA300HOU_1798
    VC40:
    SAVC_08265

Genome Viewer[edit | edit source]

COL
NCTC8325
Newman
USA300_FPR3757

Alignments[edit | edit source]

  • alignment of orthologues:
    CLUSTAL format alignment by MAFFT L-INS-i (v7.307)


    COL             ------------------------------------------------------------
    NCTC8325        MKRSTNQEKFLDTLIRLNTKIEELGKINILNNHIYSEYFFRDLLNIVYGYSLENHNKKQK
    Newman          ------------------------------------------------------------
    USA300_FPR3757  MKRSTNQEKFLDTLIRLNTKIEELGKINILNNHIYSEYFFRDLLNIVYGYSLENHNKKQK
                                                                                

    COL             ---------------------------------------MEEGYRLKFIFIGNQNNNIKN
    NCTC8325        NAPAFDLIDNTNKIIIQVTATCKKQKIEDTLKKEYLTNKMEEGYRLKFIFIGNQNNNIKN
    Newman          ---------------------------------------MEEGYRLKFIFIGNQNNNIKN
    USA300_FPR3757  NAPAFDLIDNTNKIIIQVTATCKKQKIEDTLKKEYLTNKMEEGYRLKFIFIGNQNNNIKN
                                                           *********************

    COL             KNFSNPHNILFDSKKDIILTQDLCEEFLNLNINKQDHAIELLKKELSPLLFEDSLSYLKE
    NCTC8325        KNFSNPHNILFDSKKDIILTQDLCEEFLNLNINKQDHAIELLKKELSPLLFEDSLSYLKE
    Newman          KNFSNPHNILFDSKKDIILTQDLCEEFLNLNINKQDHAIELLKKELSPLLFEDSLSYLKE
    USA300_FPR3757  KNFSNPHNILFDSKKDIILTQDLCEEFLNLNINKQDHAIELLKKELSPLLFEDSLSYLKE
                    ************************************************************

    COL             EFINEKLEFNISNLASRYTANNDVDTINNKIIEGISITNNFKYTNISYLKELKGYIENDI
    NCTC8325        EFINEKLEFNISNLASRYTANNDVDTINNKIIEGISITNNFKYTNISYLKELKGYIENDI
    Newman          EFINEKLEFNISNLASRYTANNDVDTINNKIIEGISITNNFKYTNISYLKELKGYIENDI
    USA300_FPR3757  EFINEKLEFNISNLASRYTANNDVDTINNKIIEGISITNNFKYTNISYLKELKGYIENDI
                    ************************************************************

    COL             LDKMKSKYAKNIYLNFKKIFSNLEQSVNNYLELEEEFEEKKKYLSEIYELIDEINIDPYI
    NCTC8325        LDKMKSKYAKNIYLNFKKIFSNLEQSVNNYLELEEEFEEKKKYLSEIYELIDEINIDPYI
    Newman          LDKMKSKYAKNIYLNFKKIFSNLEQSVNNYLELEEEFEEKKKYLSEIYELIDEINIDPYI
    USA300_FPR3757  LDKMKSKYAKNIYLNFKKIFSNLEQSVNNYLELEEEFEEKKKYLSEIYELIDEINIDPYI
                    ************************************************************

    COL             FLTEHNECNIYKISENEKLELQTYMSKIEKVLLKYQTYLKETCKECLFYPYLLVQGEAGI
    NCTC8325        FLTEHNECNIYKISENEKLELQTYMSKIEKVLLKYQTYLKETCKECLFYPYLLVQGEAGI
    Newman          FLTEHNECNIYKISENEKLELQTYMSKIEKVLLKYQTYLKETCKECLFYPYLLVQGEAGI
    USA300_FPR3757  FLTEHNECNIYKISENEKLELQTYMSKIEKVLLKYQTYLKETCKECLFYPYLLVQGEAGI
                    ************************************************************

    COL             GKSHLLAHLSKKLRDENHIIYLFLGQFFTKNEDPWHQILNDLEVTNSVDNFLRSISNKAK
    NCTC8325        GKSHLLAHLSKKLRDENHIIYLFLGQFFTKNEDPWHQILNDLEVTNSVDNFLRSISNKAK
    Newman          GKSHLLAHLSKKLRDENHIIYLFLGQFFTKNEDPWHQILNDLEVTNSVDNFLRSISNKAK
    USA300_FPR3757  GKSHLLAHLSKKLRDENHIIYLFLGQFFTKNEDPWHQILNDLEVTNSVDNFLRSISNKAK
                    ************************************************************

    COL             ETKKRAFIIIDALNEGEGKRLWGNYFQSFINHIKKYSNIALIFSIRTPFEDVILPKNAIQ
    NCTC8325        ETKKRAFIIIDALNEGEGKRLWGNYFQSFINHIKKYSNIALIFSIRTPFEDVILPKNAIQ
    Newman          ETKKRAFIIIDALNEGEGKRLWGNYFQSFINHIKKYSNIALIFSIRTPFEDVILPKNAIQ
    USA300_FPR3757  ETKKRAFIIIDALNEGEGKRLWGNYFQSFINHIKKYSNIALIFSIRTPFEDVILPKNAIQ
                    ************************************************************

    COL             DNNIVVFQHEGFSKEENYNPIVSFCDFYGLELPKLPILNPEFNNPLFLKLMCEYCVNKFK
    NCTC8325        DNNIVVFQHEGFSKEENYNPIVSFCDFYGLELPKLPILNPEFNNPLFLKLMCEYCVNKFK
    Newman          DNNIVVFQHEGFSKEENYNPIVSFCDFYGLELPKLPILNPEFNNPLFLKLMCEYCVNKFK
    USA300_FPR3757  DNNIVVFQHEGFSKEENYNPIVSFCDFYGLELPKLPILNPEFNNPLFLKLMCEYCVNKFK
                    ************************************************************

    COL             EFDQTISVAELFTNVLKTVNINLSKEDKFDFDKNINVVQKVIKGLVELMNDSEFNQLNYE
    NCTC8325        EFDQTISVAELFTNVLKTVNINLSKEDKFDFDKNINVVQKVIKGLVELMNDSEFNQLNYE
    Newman          EFDQTISVAELFTNVLKTVNINLSKEDKFDFDKNINVVQKVIKGLVELMNDSEFNQLNYE
    USA300_FPR3757  EFDQTISVAELFTNVLKTVNINLSKEDKFDFDKNINVVQKVIKGLVELMNDSEFNQLNYE
                    ************************************************************

    COL             ESYTVVNNIAKEYVQKSNRFLEALIDENILIKNTGYKGEMIIYFSYERMGDYFLSEYLLE
    NCTC8325        ESYTVVNNIAKEYVQKSNRFLEALIDENILIKNTGYKGEMIIYFSYERMGDYFLSEYLLE
    Newman          ESYTVVNNIAKEYVQKSNRFLEALIDENILIKNTGYKGEMIIYFSYERMGDYFLSEYLLE
    USA300_FPR3757  ESYTVVNNIAKEYVQKSNRFLEALIDENILIKNTGYKGEMIIYFSYERMGDYFLSEYLLE
                    ************************************************************

    COL             KYRNVDKRDLVTKLQSDEKVTRYFQKEDDLSYNRGLINELFIKLANEFNIELFEVFPQFK
    NCTC8325        KYRNVDKRDLVTKLQSDEKVTRYFQKEDDLSYNRGLINELFIKLANEFNIELFEVFPQFK
    Newman          KYRNVDKRDLVTKLQSDEKVTRYFQKEDDLSYNRGLINELFIKLANEFNIELFEVFPQFK
    USA300_FPR3757  KYRNVDKRDLVTKLQSDEKVTRYFQKEDDLSYNRGLINELFIKLANEFNIELFEVFPQFK
                    ************************************************************

    COL             NNYNMIYSFINSLVWRKDGSISKHTKCYISDNVIPYDAFRNNFLDVLLIKMPQKNHPLNI
    NCTC8325        NNYNMIYSFINSLVWRKDGSISKHTKCYISDNVIPYDAFRNNFLDVLLIKMPQKNHPLNI
    Newman          NNYNMIYSFINSLVWRKDGSISKHTKCYISDNVIPYDAFRNNFLDVLLIKMPQKNHPLNI
    USA300_FPR3757  NNYNMIYSFINSLVWRKDGSISKHTKCYISDNVIPYDAFRNNFLDVLLIKMPQKNHPLNI
                    ************************************************************

    COL             WALHKLLKQCNLGKRDFLWTQYISINNEKVFEIINWLFSNYKKLDEETAEKYMIFLTWIF
    NCTC8325        WALHKLLKQCNLGKRDFLWTQYISINNEKVFEIINWLFSNYKKLDEETAEKYMIFLTWIF
    Newman          WALHKLLKQCNLGKRDFLWTQYISINNEKVFEIINWLFSNYKKLDEETAEKYMIFLTWIF
    USA300_FPR3757  WALHKLLKQCNLGKRDFLWTQYISINNEKVFEIINWLFSNYKKLDEETAEKYMIFLTWIF
                    ************************************************************

    COL             SATNNKLRDLGTKSLVKLFKTFPTKIIGLLKLFENNNDPYIVERLYASVLGATLRIDICE
    NCTC8325        SATNNKLRDLGTKSLVKLFKTFPTKIIGLLKLFENNNDPYIVERLYASVLGATLRIDICE
    Newman          SATNNKLRDLGTKSLVKLFKTFPTKIIGLLKLFENNNDPYIVERLYASVLGATLRIDICE
    USA300_FPR3757  SATNNKLRDLGTKSLVKLFKTFPTKIIGLLKLFENNNDPYIVERLYASVLGATLRIDICE
                    ************************************************************

    COL             IHIEIANYIYEEIFDKEMVYPHILMRDYARQTIEYISLSKDISNINLEKIRPPYKSNWYK
    NCTC8325        IHIEIANYIYEEIFDKEMVYPHILMRDYARQTIEYISLSKDISNINLEKIRPPYKSNWYK
    Newman          IHIEIANYIYEEIFDKEMVYPHILMRDYARQTIEYISLSKDISNINLEKIRPPYKSNWYK
    USA300_FPR3757  IHIEIANYIYEEIFDKEMVYPHILMRDYARQTIEYISLSKDISNINLEKIRPPYKSNWYK
                    ************************************************************

    COL             KEYSNLNIDDYIKSLKNKLDSHLHFSIDKIKNSMTTEYGRGTGAYGDFGRYVFGYAVRNW
    NCTC8325        KEYSNLNIDDYIKSLKNKLDSHLHFSIDKIKNSMTTEYGRGTGAYGDFGRYVFGYAVRNW
    Newman          KEYSNLNIDDYIKSLKNKLDSHLHFSIDKIKNSMTTEYGRGTGAYGDFGRYVFGYAVRNW
    USA300_FPR3757  KEYSNLNIDDYIKSLKNKLDSHLHFSIDKIKNSMTTEYGRGTGAYGDFGRYVFGYAVRNW
                    ************************************************************

    COL             VKGFKSDQDLSNIALMRIFEMGYDAKLHGEFDMWVNRYDNFNNSIERISKKYQWIAYYEI
    NCTC8325        VKGFKSDQDLSNIALMRIFEMGYDAKLHGEFDMWVNRYDNFNNSIERISKKYQWIAYYEI
    Newman          VKGFKSDQDLSNIALMRIFEMGYDAKLHGEFDMWVNRYDNFNNSIERISKKYQWIAYYEI
    USA300_FPR3757  VKGFKSDQDLSNIALMRIFEMGYDAKLHGEFDMWVNRYDNFNNSIERISKKYQWIAYYEI
                    ************************************************************

    COL             LAKLVDKFPDVQYSGLWDDYIRDIDPTLLLLEIDK-------------------------
    NCTC8325        LAKLVDKFPDVQYSGLWDDYIRDIDPTLLLLEIDKESKILVPSPLPSHQSNEWVKNTKVF
    Newman          LAKLVDKFPDVQYSGLWDDYIRDIDPTLLLLEIDK-------------------------
    USA300_FPR3757  LAKLVDKFPDVQYSGLWDDYIRDIDPTLLLLEIDKESKILVPSPLPSHQSNEWVKNTKVF
                    ***********************************                         

    COL             ------------------------------------------------------------
    NCTC8325        DETKLFLEIDIDNHRYICLSSKFNFEKREKEIPFEDRDSCYFLAMGYFYNKEDSNEIIKG
    Newman          ------------------------------------------------------------
    USA300_FPR3757  DETKLFLEIDIDNHRYICLSSKFNFEKREKEIPFEDRDSCYFLAMGYFYNKEDSNEIIKG
                                                                                

    COL             ------------------------------------------------------------
    NCTC8325        YENNYDRGINIPRAHSIYLYEYYWSEAYKNYKEGYLTESDGKLCPAIYEYFWELDYSVKD
    Newman          ------------------------------------------------------------
    USA300_FPR3757  YENNYDRGINIPRAHSIYLYEYYWSEAYKNYKEGYLTESDGKLCPAIYEYFWELDYSVKD
                                                                                

    COL             ------------------------------------------------------------
    NCTC8325        KSISFYIPCKEIVDYFSLIQTEEGVWKTKFGETICINSKLLEFDNECLLIKKESLLNFLN
    Newman          ------------------------------------------------------------
    USA300_FPR3757  KSISFYIPCKEIVDYFSLIQTEEGVWKTKFGETICINSKLLEFDNECLLIKKESLLNFLN
                                                                                

    COL             --------------------------------------------------
    NCTC8325        TKKLSIGWKIYLEKISLRDRQEWWYNVFYDDGKYNKKIIKNDMSKIRRNF
    Newman          --------------------------------------------------
    USA300_FPR3757  TKKLSIGWKIYLEKISLRDRQEWWYNVFYDDGKYNKKIIKNDMSKIRRNF