Longibrachin LGA IV

Details

Top
Internal ID 933c339d-1e84-4889-bcbb-d0880678cff3
Taxonomy Organic Polymers > Polypeptides
IUPAC Name (2S)-2-[(3S)-3-[[2-[[(2S)-2-[[(2S)-1-[2-[[(2S)-2-[[2-[[2-[[(2R)-2-[[2-[[(2R)-2-[[2-[[2-[[(2R)-2-[[2-[[(2S)-2-[(2-acetamido-2-methylpropanoyl)amino]propanoyl]amino]-2-methylpropanoyl]amino]propanoyl]amino]-2-methylpropanoyl]amino]-2-methylpropanoyl]amino]-5-amino-5-oxopentanoyl]amino]-2-methylpropanoyl]amino]-3-methylbutanoyl]amino]-2-methylpropanoyl]amino]acetyl]amino]-3-methylbutanoyl]amino]-2-methylpropanoyl]pyrrolidine-2-carbonyl]amino]-3-methylbutanoyl]amino]-2-methylpropanoyl]amino]-3-methyl-2-oxopentyl]-N-[(2R)-5-amino-1-[[(2S)-1-hydroxy-3-phenylpropan-2-yl]amino]-1,5-dioxopentan-2-yl]pentanediamide
SMILES (Canonical) CCC(C)(C(=O)CC(CCC(=O)N)C(=O)NC(CCC(=O)N)C(=O)NC(CC1=CC=CC=C1)CO)NC(=O)C(C)(C)NC(=O)C(C(C)C)NC(=O)C2CCCN2C(=O)C(C)(C)NC(=O)C(C(C)C)NC(=O)CNC(=O)C(C)(C)NC(=O)C(C(C)C)NC(=O)C(C)(C)NC(=O)C(CCC(=O)N)NC(=O)C(C)(C)NC(=O)C(C)(C)NC(=O)C(C)NC(=O)C(C)(C)NC(=O)C(C)NC(=O)C(C)(C)NC(=O)C
SMILES (Isomeric) CC[C@@](C)(C(=O)C[C@H](CCC(=O)N)C(=O)N[C@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)CO)NC(=O)C(C)(C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]2CCCN2C(=O)C(C)(C)NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)C(C)(C)NC(=O)[C@@H](C(C)C)NC(=O)C(C)(C)NC(=O)[C@@H](CCC(=O)N)NC(=O)C(C)(C)NC(=O)C(C)(C)NC(=O)[C@@H](C)NC(=O)C(C)(C)NC(=O)[C@H](C)NC(=O)C(C)(C)NC(=O)C
InChI InChI=1S/C92H152N22O24/c1-28-92(27,59(117)44-54(36-39-60(93)118)69(124)100-56(37-40-61(94)119)70(125)99-55(46-115)43-53-33-30-29-31-34-53)113-82(137)90(23,24)110-74(129)65(48(4)5)103-72(127)58-35-32-42-114(58)83(138)91(25,26)111-73(128)64(47(2)3)102-63(121)45-96-76(131)84(11,12)109-75(130)66(49(6)7)104-80(135)87(17,18)108-71(126)57(38-41-62(95)120)101-79(134)88(19,20)112-81(136)89(21,22)107-68(123)51(9)98-78(133)86(15,16)106-67(122)50(8)97-77(132)85(13,14)105-52(10)116/h29-31,33-34,47-51,54-58,64-66,115H,28,32,35-46H2,1-27H3,(H2,93,118)(H2,94,119)(H2,95,120)(H,96,131)(H,97,132)(H,98,133)(H,99,125)(H,100,124)(H,101,134)(H,102,121)(H,103,127)(H,104,135)(H,105,116)(H,106,122)(H,107,123)(H,108,126)(H,109,130)(H,110,129)(H,111,128)(H,112,136)(H,113,137)/t50-,51+,54-,55-,56+,57+,58-,64-,65-,66+,92-/m0/s1
InChI Key IZAHAERNXKHNJZ-PWVGHYQSSA-N
Popularity 1 reference in papers

Physical and Chemical Properties

Top
Molecular Formula C92H152N22O24
Molecular Weight 1950.30 g/mol
Exact Mass 1949.13498380 g/mol
Topological Polar Surface Area (TPSA) 711.00 Ų
XlogP -2.20
Atomic LogP (AlogP) -3.95
H-Bond Acceptor 24
H-Bond Donor 22
Rotatable Bonds 55

Synonyms

Top
There are no found synonyms.

2D Structure

Top
2D Structure of Longibrachin LGA IV

3D Structure

Top

ADMET Properties (via admetSAR 2)

Top
Target Value Probability (raw) Probability (%)
Human Intestinal Absorption + 0.9071 90.71%
Caco-2 - 0.8586 85.86%
Blood Brain Barrier - 0.5750 57.50%
Human oral bioavailability - 0.6143 61.43%
Subcellular localzation Lysosomes 0.5709 57.09%
OATP2B1 inhibitior - 1.0000 100.00%
OATP1B1 inhibitior + 0.8315 83.15%
OATP1B3 inhibitior + 0.9399 93.99%
MATE1 inhibitior - 0.9012 90.12%
OCT2 inhibitior - 0.8000 80.00%
BSEP inhibitior + 0.9531 95.31%
P-glycoprotein inhibitior + 0.7419 74.19%
P-glycoprotein substrate + 0.8593 85.93%
CYP3A4 substrate + 0.7454 74.54%
CYP2C9 substrate - 0.6048 60.48%
CYP2D6 substrate - 0.8313 83.13%
CYP3A4 inhibition + 0.7767 77.67%
CYP2C9 inhibition - 0.8452 84.52%
CYP2C19 inhibition - 0.6379 63.79%
CYP2D6 inhibition - 0.7933 79.33%
CYP1A2 inhibition - 0.8870 88.70%
CYP2C8 inhibition + 0.7352 73.52%
CYP inhibitory promiscuity - 0.8665 86.65%
UGT catelyzed + 0.7000 70.00%
Carcinogenicity (binary) - 0.7800 78.00%
Carcinogenicity (trinary) Non-required 0.6492 64.92%
Eye corrosion - 0.9887 98.87%
Eye irritation - 0.8954 89.54%
Skin irritation - 0.7921 79.21%
Skin corrosion - 0.8863 88.63%
Ames mutagenesis - 0.6878 68.78%
Human Ether-a-go-go-Related Gene inhibition + 0.7155 71.55%
Micronuclear + 0.6600 66.00%
Hepatotoxicity + 0.5657 56.57%
skin sensitisation - 0.8783 87.83%
Respiratory toxicity + 0.8333 83.33%
Reproductive toxicity + 0.8556 85.56%
Mitochondrial toxicity + 0.8750 87.50%
Nephrotoxicity - 0.6521 65.21%
Acute Oral Toxicity (c) III 0.6709 67.09%
Estrogen receptor binding - 0.5849 58.49%
Androgen receptor binding + 0.7577 75.77%
Thyroid receptor binding + 0.7751 77.51%
Glucocorticoid receptor binding + 0.8399 83.99%
Aromatase binding + 0.8132 81.32%
PPAR gamma + 0.8125 81.25%
Honey bee toxicity - 0.7200 72.00%
Biodegradation - 0.8750 87.50%
Crustacea aquatic toxicity + 0.5400 54.00%
Fish aquatic toxicity + 0.7037 70.37%

Targets

Top

Proven Targets:

CHEMBL ID UniProt ID Name Min activity Assay type Source
No proven targets yet!

Predicted Targets (via Super-PRED):

CHEMBL ID UniProt ID Name Probability Model accuracy
CHEMBL2581 P07339 Cathepsin D 99.96% 98.95%
CHEMBL1795139 Q8IU80 Transmembrane protease serine 6 98.52% 98.33%
CHEMBL220 P22303 Acetylcholinesterase 98.33% 94.45%
CHEMBL1914 P06276 Butyrylcholinesterase 98.10% 95.00%
CHEMBL3251 P19838 Nuclear factor NF-kappa-B p105 subunit 98.03% 96.09%
CHEMBL2693 Q9UIQ6 Cystinyl aminopeptidase 97.46% 97.64%
CHEMBL4588 P22894 Matrix metalloproteinase 8 97.32% 94.66%
CHEMBL4979 P13866 Sodium/glucose cotransporter 1 95.32% 98.24%
CHEMBL4026 P40763 Signal transducer and activator of transcription 3 95.31% 82.69%
CHEMBL3359 P21462 Formyl peptide receptor 1 94.24% 93.56%
CHEMBL4394 Q9NYA1 Sphingosine kinase 1 94.07% 96.03%
CHEMBL2514 O95665 Neurotensin receptor 2 93.85% 100.00%
CHEMBL221 P23219 Cyclooxygenase-1 93.84% 90.17%
CHEMBL1907591 P30926 Neuronal acetylcholine receptor; alpha4/beta4 93.23% 100.00%
CHEMBL2664 P23526 Adenosylhomocysteinase 92.73% 86.67%
CHEMBL4478 Q00975 Voltage-gated N-type calcium channel alpha-1B subunit 92.51% 97.14%
CHEMBL3807 P17706 T-cell protein-tyrosine phosphatase 92.37% 93.00%
CHEMBL340 P08684 Cytochrome P450 3A4 91.51% 91.19%
CHEMBL321 P14780 Matrix metalloproteinase 9 90.91% 92.12%
CHEMBL3137262 O60341 LSD1/CoREST complex 90.80% 97.09%
CHEMBL4227 P25090 Lipoxin A4 receptor 90.04% 100.00%
CHEMBL3837 P07711 Cathepsin L 88.99% 96.61%
CHEMBL4018 P49146 Neuropeptide Y receptor type 2 88.86% 98.94%
CHEMBL333 P08253 Matrix metalloproteinase-2 88.30% 96.31%
CHEMBL5619 P27695 DNA-(apurinic or apyrimidinic site) lyase 87.82% 91.11%
CHEMBL3060 Q9Y345 Glycine transporter 2 87.39% 99.17%
CHEMBL3130 O00329 PI3-kinase p110-delta subunit 87.31% 96.47%
CHEMBL5701 Q9H2K8 Serine/threonine-protein kinase TAO3 86.93% 96.67%
CHEMBL5028 O14672 ADAM10 86.31% 97.50%
CHEMBL1907594 P30926 Neuronal acetylcholine receptor; alpha3/beta4 86.26% 97.23%
CHEMBL3108638 O15164 Transcription intermediary factor 1-alpha 86.16% 95.56%
CHEMBL5608 Q16288 NT-3 growth factor receptor 85.51% 95.89%
CHEMBL5043 Q6P179 Endoplasmic reticulum aminopeptidase 2 84.88% 91.81%
CHEMBL1873 P00750 Tissue-type plasminogen activator 84.36% 93.33%
CHEMBL3430907 Q96GD4 Aurora kinase B/Inner centromere protein 84.34% 97.50%
CHEMBL4203 Q9HAZ1 Dual specificity protein kinase CLK4 83.91% 94.45%
CHEMBL5261 Q7L7X3 Serine/threonine-protein kinase TAO1 83.65% 89.33%
CHEMBL2274 Q9H228 Sphingosine 1-phosphate receptor Edg-8 83.49% 100.00%
CHEMBL4657 Q6V1X1 Dipeptidyl peptidase VIII 83.32% 97.21%
CHEMBL3004 P33527 Multidrug resistance-associated protein 1 81.98% 96.37%
CHEMBL1075094 Q16236 Nuclear factor erythroid 2-related factor 2 81.72% 96.00%
CHEMBL6007 O75762 Transient receptor potential cation channel subfamily A member 1 81.54% 92.17%
CHEMBL1944495 P28065 Proteasome subunit beta type-9 81.13% 97.50%
CHEMBL2535 P11166 Glucose transporter 80.43% 98.75%
CHEMBL3892 Q99500 Sphingosine 1-phosphate receptor Edg-3 80.20% 97.29%

Plants that contains it

Top
Below are displayed all the plants proven (via scientific papers) to contain this compound!
To see more specific details click the taxa you are interested in.
There are no matching plants.

Cross-Links

Top
PubChem 163104502
LOTUS LTS0227927
wikiData Q105123087