Podcast
Questions and Answers
Which of the following is NOT a direct step in the drug discovery process as described in the provided content?
Which of the following is NOT a direct step in the drug discovery process as described in the provided content?
- Activity model generation
- Clinical studies
- Lead identification
- Searching for natural products (correct)
What is the primary purpose of 'structural and physicochemical filters' during the drug discovery process?
What is the primary purpose of 'structural and physicochemical filters' during the drug discovery process?
- To identify the interaction place of a drug with a target
- To optimize the activity profile of a lead compound
- To assess the metabolic and toxic aspects of a drug
- To evaluate the drug likeness of a ligand (correct)
According to the content, what is a key aspect considered during the 'docking' process?
According to the content, what is a key aspect considered during the 'docking' process?
- The geometry between the ligand and its receptor (correct)
- The ligand's similarity to other known compounds
- The ADMET properties of the compound
- The chemical stability of the compound
Which activity is NOT part of the post-processing steps after docking?
Which activity is NOT part of the post-processing steps after docking?
What is the main objective of 'SAR' in the context of drug discovery, based on the content?
What is the main objective of 'SAR' in the context of drug discovery, based on the content?
What does 'ADMET' refer to in the post-processing stage of drug discovery?
What does 'ADMET' refer to in the post-processing stage of drug discovery?
Which of the following is a challenge associated with virtual drug design as described in the content?
Which of the following is a challenge associated with virtual drug design as described in the content?
Based on the text, what does a 'pharmacophore identification' help to determine?
Based on the text, what does a 'pharmacophore identification' help to determine?
What is the primary goal when determining sample size?
What is the primary goal when determining sample size?
Which of these MUST be true for a sample to be considered representative?
Which of these MUST be true for a sample to be considered representative?
What does sample randomness primarily ensure?
What does sample randomness primarily ensure?
In statistical analysis, what is the term for values computed for the population?
In statistical analysis, what is the term for values computed for the population?
Mathematical statistics is primarily divided into which two branches?
Mathematical statistics is primarily divided into which two branches?
Which of the following best describes the purpose of analysis of variance?
Which of the following best describes the purpose of analysis of variance?
When categorizing sample sizes, what range defines a 'large sample'?
When categorizing sample sizes, what range defines a 'large sample'?
What does 'a statistical unit' refer to?
What does 'a statistical unit' refer to?
In a Winner Takes All (WTA) neural network, what is the key characteristic regarding neuron activation?
In a Winner Takes All (WTA) neural network, what is the key characteristic regarding neuron activation?
What is the distinguishing feature of a Winner Takes Most (WTM) network compared to a WTA network?
What is the distinguishing feature of a Winner Takes Most (WTM) network compared to a WTA network?
What is the primary purpose of using a neighborhood function in a WTM network?
What is the primary purpose of using a neighborhood function in a WTM network?
In the context of QSAR, what molecular property is often projected onto a 2D topographic map using CoMSA?
In the context of QSAR, what molecular property is often projected onto a 2D topographic map using CoMSA?
What kind of data is entered into the Kohonen network when creating a CoMSA representation, regarding the molecule's surface?
What kind of data is entered into the Kohonen network when creating a CoMSA representation, regarding the molecule's surface?
What is done with the electrostatic potential values after mapping them to neurons in the CoMSA method?
What is done with the electrostatic potential values after mapping them to neurons in the CoMSA method?
What is the purpose of superimposing 3D structures in the CoMSA method?
What is the purpose of superimposing 3D structures in the CoMSA method?
In the context of 3D-QSAR, how is the reverse image of a receptor characterized?
In the context of 3D-QSAR, how is the reverse image of a receptor characterized?
What is the primary purpose of the centering operation described?
What is the primary purpose of the centering operation described?
What effect does the centering operation have on the distribution of samples in m-dimensional space?
What effect does the centering operation have on the distribution of samples in m-dimensional space?
In the standardization formula, what does std(x_j)
represent?
In the standardization formula, what does std(x_j)
represent?
What is the key outcome of standardizing the data?
What is the key outcome of standardizing the data?
Given a dataset where the mean of a column’s values is $5$, how are the values changed during a centering operation?
Given a dataset where the mean of a column’s values is $5$, how are the values changed during a centering operation?
Before standardization, if two columns in a dataset have different standard deviations, what issue would this cause in data analysis?
Before standardization, if two columns in a dataset have different standard deviations, what issue would this cause in data analysis?
If a dataset is centered, but not standardized, what can be said about the contribution of its variables to the total variance?
If a dataset is centered, but not standardized, what can be said about the contribution of its variables to the total variance?
Which of the following is NOT a typical use of descriptive statistics?
Which of the following is NOT a typical use of descriptive statistics?
Which of the following best describes the role of hypothesis testing?
Which of the following best describes the role of hypothesis testing?
In hypothesis testing, a one-sample problem involves making inferences about:
In hypothesis testing, a one-sample problem involves making inferences about:
When conducting a hypothesis test, the hypotheses considered are formulated as:
When conducting a hypothesis test, the hypotheses considered are formulated as:
Which statistical test is appropriate for comparing the means of two independent groups?
Which statistical test is appropriate for comparing the means of two independent groups?
What needs consideration when using the Student's t-test?
What needs consideration when using the Student's t-test?
If there are more than two groups being compared, which type of test should be used?
If there are more than two groups being compared, which type of test should be used?
In the given example, after calculating the t-statistic as 2.6789 and finding the critical value of t from the table as 2.16, what is the conclusion of the hypothesis test, if the significance level ($\alpha$) is equal to 0.05?
In the given example, after calculating the t-statistic as 2.6789 and finding the critical value of t from the table as 2.16, what is the conclusion of the hypothesis test, if the significance level ($\alpha$) is equal to 0.05?
In calculating the standard error of the difference between two means, the degrees of freedom are calculated as:
In calculating the standard error of the difference between two means, the degrees of freedom are calculated as:
In graph theory, what do 'V' and 'E' typically represent?
In graph theory, what do 'V' and 'E' typically represent?
What is a neighboring matrix (adjacency matrix) used for in the context of molecular graphs?
What is a neighboring matrix (adjacency matrix) used for in the context of molecular graphs?
Which of the following is NOT a typical component of a connection table (CT) used in chemistry?
Which of the following is NOT a typical component of a connection table (CT) used in chemistry?
What is the purpose of a molecular editor in the context of computational chemistry?
What is the purpose of a molecular editor in the context of computational chemistry?
What does the acronym SMILES stand for?
What does the acronym SMILES stand for?
Which of the following SMILES notations represents a double bond?
Which of the following SMILES notations represents a double bond?
What does 'CANGEN' refer to, within the context of SMILES notation?
What does 'CANGEN' refer to, within the context of SMILES notation?
In SMILES notation, what do square brackets ( []
) generally indicate?
In SMILES notation, what do square brackets ( []
) generally indicate?
Flashcards
Drug Graph
Drug Graph
A graph representation of a drug, where vertices represent atoms and edges represent bonds.
Adjacency Matrix
Adjacency Matrix
A matrix that encodes the connections between atoms in a molecule. Each row and column represents an atom, and a '1' indicates a bond between the corresponding atoms.
Connection Table
Connection Table
A table that lists all the atoms in a molecule and their connections, including bond order and atom type.
SMILES (Simplified Molecular Input Line Entry System)
SMILES (Simplified Molecular Input Line Entry System)
Signup and view all the flashcards
MOL File
MOL File
Signup and view all the flashcards
Molecular Editor
Molecular Editor
Signup and view all the flashcards
SMILES to 3D Conversion
SMILES to 3D Conversion
Signup and view all the flashcards
Molecular Representation
Molecular Representation
Signup and view all the flashcards
Virtual Drug Screening
Virtual Drug Screening
Signup and view all the flashcards
QSAR (Quantitative Structure-Activity Relationship)
QSAR (Quantitative Structure-Activity Relationship)
Signup and view all the flashcards
Lead Optimization
Lead Optimization
Signup and view all the flashcards
Pharmacophore
Pharmacophore
Signup and view all the flashcards
Drug Likeness Filters
Drug Likeness Filters
Signup and view all the flashcards
Disease Specification
Disease Specification
Signup and view all the flashcards
Lead Identification
Lead Identification
Signup and view all the flashcards
Clinical Studies
Clinical Studies
Signup and view all the flashcards
Representative sample
Representative sample
Signup and view all the flashcards
Representative method
Representative method
Signup and view all the flashcards
Random sampling
Random sampling
Signup and view all the flashcards
Sample size
Sample size
Signup and view all the flashcards
Statistic
Statistic
Signup and view all the flashcards
Parameter
Parameter
Signup and view all the flashcards
Estimation theory
Estimation theory
Signup and view all the flashcards
Hypothesis testing
Hypothesis testing
Signup and view all the flashcards
Winner-Takes-All (WTA)
Winner-Takes-All (WTA)
Signup and view all the flashcards
Winner-Takes-Most (WTM)
Winner-Takes-Most (WTM)
Signup and view all the flashcards
Neighborhood Function
Neighborhood Function
Signup and view all the flashcards
Molecular Surface
Molecular Surface
Signup and view all the flashcards
Electrostatic Potential Map (MEP)
Electrostatic Potential Map (MEP)
Signup and view all the flashcards
CoMSA (Comparative Molecular Surface Analysis)
CoMSA (Comparative Molecular Surface Analysis)
Signup and view all the flashcards
Kohonen Reference Map
Kohonen Reference Map
Signup and view all the flashcards
Drug Representation
Drug Representation
Signup and view all the flashcards
Data Centering
Data Centering
Signup and view all the flashcards
Data Standardization
Data Standardization
Signup and view all the flashcards
Arithmetic Mean
Arithmetic Mean
Signup and view all the flashcards
Descriptive Statistics
Descriptive Statistics
Signup and view all the flashcards
Molecular Descriptors
Molecular Descriptors
Signup and view all the flashcards
Lipophilicity
Lipophilicity
Signup and view all the flashcards
Computer-Aided Drug Design (CAMD)
Computer-Aided Drug Design (CAMD)
Signup and view all the flashcards
In Silico Medicine
In Silico Medicine
Signup and view all the flashcards
Null hypothesis
Null hypothesis
Signup and view all the flashcards
Alternative hypothesis
Alternative hypothesis
Signup and view all the flashcards
Student's t-test for independent groups
Student's t-test for independent groups
Signup and view all the flashcards
Student's t-test for dependent/paired data
Student's t-test for dependent/paired data
Signup and view all the flashcards
Analysis of Variance (ANOVA)
Analysis of Variance (ANOVA)
Signup and view all the flashcards
Degrees of freedom (df)
Degrees of freedom (df)
Signup and view all the flashcards
P-value
P-value
Signup and view all the flashcards
Study Notes
Contact Information
- Professor: Dr. Andrzej Bąk
- Email: [email protected], [email protected]
- Phone: 32 359 11 97
- Address: UI. Szkolna 9, Katowice, room 40, I floor
In-silico Medicine Outline
- Data, informatics, knowledge in medicine
- Coding of drug structure – molecular editors
- Receptors as molecular targets
- Molecular descriptors – lipophilicity
- Computer Assisted Molecular Design (CAMD) – QSAR modeling
- Artificial intelligence in medicine (AI) – essentials of neural networks (NN)
- Basics of biostatistics
- Data preprocessing
- Basic procedures of data modeling
- Essential techniques of data visualization and data grouping
Literature of Subject
- Edited by Thomas Engel and Johann Gasteiger
- Chemoinformatics: Basic Concepts and Methods
- Textbook of Medical Statistics (for Medical Students, editors Xiuhua Guo and Fuzhong Xue)
- Introduction to Medicinal Chemistry (seventh edition, by Graham L. Patrick)
- Wold Wide Web database
Computer Assisted Drug Design (CADD)
- AIM – production of substances that act in the human body →rational drug production
- Molecular production of properties → drug modeling
Medicine
- Medicine – the art of human treatment/healing
- Pharmacy/Pharmacology/Medicinal Chemistry: The science of molecules and their transformations
- Drug Molecule: A molecule with medicinal properties
- Paracelsus XVI century quote – “All things are poison, and nothing is without poison, only the dose permits something not be poisonous.”
History of Drug Discovery
- 5100 years ago – China: Chiang Shan herbs (fever), Ma Huang (anti-cough drug)
- 3 w.p.n.e.: Extract from poppy grain (pain-killers), scopolamine (truth serum), Belladona (atropine)
- Inca nations: coca (energizing)
- India: rezorcinol (high blood pressure)
- 1633: Calancha monk – extract from quina tree (drug to malaria)
- 1928: Alexander Fleming – penicillin (antibiotics, antybiotos)
- “Sometimes you find what you are not looking for”
- “The Nature produced penicilline, I have just discovered it”
Accident = Serendipity - "lucky shot"
- Engineer in the military health service
- Study of biological discharge to kill bacteria
- Lisozyme (1921) - first antibiotics
- Penicillin (1928): fungus (Penicillium notatum) → Staphylococcus
- 1941: Florey and Chain – experiment on mice (infected with streptococcus)
- 1941: Albert Aleksander – patient 0 (experiment with thorn of rose) and patient survival despite death
- Karl Folkers (Merk) – specification of Penicillin structure
Medicinal Product
- Directive 2001/83/EC of the European Parliament and the Council (November 6, 2001) → any substance or mixture of substances intended for diseases in humans or animals, and substance/mixture of substances used for diagnosis or restoring, correcting or modifying physiological functions
Classification of Drug Molecules
- Natural compounds - materials from plants and animals (e.g. vitamins, hormones, amino acids, antibiotics, alkaloids, glycosides)
- Synthesis compounds - pure synthesis or synthesis from naturally occurring compounds (e.g. morphine, atropine, steroids, cocaine
- Semi-synthesis compounds - compounds that cannot be purely synthesized or isolated from natural sources inexpensively thus using natural intermediates for desired product (e.g. semi-synthetic penicillins)
Data Exploration
- Graph of compounds published in CAS, rising exponentially since 1965
- Graph relating number of new drugs & R&D spending exhibiting a negative trend correlating to log-scale graph exhibiting an exponential negative trend
Why new drugs are so expensive?
- Absorption
- Distribution
- Metabolism
- Excretion
- Toxicity
- ADMET
Drug Analysis
- 2D → 3D molecular generators (CORINA, MOPAC, BABEL, SYBYL, MOE, etc.) → input data
- Pre-processing/descriptor selection (scaling, centering, standardization)
- Structure coding
- Statistical methods
- Artificial intelligence (neural networks)
- Data analysis
Chemoinformatics of Drugs
- Molecule
- Structure
- Descriptors
- Data Analysis
- Database Exploration
- Measures
From data to Drugs
- Chemoinformatics (pharmacophore, serendipity)
- Bioinformatics (de novo)
- Similarity searching
- Molecular docking
Two Directions of Drug Design
- Ligand-based Drug Discovery (RI): ligand collection → maps of interactions → reverse image of target structure
- CC&HTS → lead structure → optimization → clinical studies
- Structure-based Drug Discovery (RD): searching for drugs of low molecular weight as complementary fulfillment of target molecule
- Identification of target → preparation of target molecule → docking → assessment of bioactivity
Two Directions of CAMD Study
- Preparation of receptor and ligand
- Structural and physicochemical filters
- Drug likeness
- Docking
- Ligand-receptor geometry
- Post processing
- Scoring/Rescoring
- Variable selection
- Activity prediction
- Model assessment
- Pose evaluation
- ADMET
- Chemical stability
Drug → from idea to implementation
- Disease specification → Optimization of Lead
- Determination of interaction place → Identifying pharmacophore
- Selection of biological test → Identification of Lead
-
Metabolic and toxic studies → Clinical studies → Final Drug Confirmation
Directions of future evolution in drug design and medicine
Searching for synthetic analogues of natural products NP → Searching for leading structures of selected biological targets → Personalized Drug Design
Virtual Drug Coding Problems
- Storing molecular structure in a database
- Searching structures in a database
- Deriving structures from measured data
- Predicting properties for a structure
- Treating chemical reactions and synthesis planning
Chemical presentation of drugs.
- 3D model
- Model 2D
- Nomenclature (Chemical names)
- Content (Chemical structures)
- Information structure
Racemic Mixtures Stereochemistry
- Oral medication for cancers (e.g. multiple myeloma) and skin disorders
- Human teratogen with life-threatening birth defects during pregnancy
- Skeletal deformities (amelia, absence of legs/arms); phocomelia (limb malformation)
- (R)-enantiomer → sedative effect and (S)-enantiomer → embryo-toxic and teratogenic effect.
- Enantiomers can racemize to each other in vivo
Linear notation in drug coding
- Trivial name: Phenylalanine
- IUPAC name: 2-amino-3-phenylpropanoic acid
- WLN: VQYZ1R
- ROSDAL: 10-2=30,2-4-5N,4-6-7=-12-7
- Smiles: NC(Cc1ccccc1)C(O)=O
- SLN: C[1]H:CH:CH:CH:CH:C(:@1)CH2CH(NH2)C(=O)OH
Molecular surfaces in drug design
- Van der Waals' surface
- Connolly's surface
Graph theory
- L. Euler (1736) → Königsberg’s seven bridges problem
- Molecular graphs G={V,E}
- V = vertexes (atoms), E = edges (bonds)
- Drug – undirected coherent graph
Coding of Drug Constitution
- Neighboring matrix/bonding matrix
- Connection table CT
Drug in MDL Molfile format
- Compound information and properties, formatted as a text file for importing and using chemical compounds
Molecular Editors
- Molecule editor
- Viewer
- Connection table
Survey of molecular editors
- Various software tools (Chem3D, ChemDraw, ACD/ChemSketch, ISIS/Draw, etc.) for molecular structure drawing, analysis, and manipulation
SMILES (Simplified Molecular Input Line Entry System)
- Entering molecular structure using a set of rules;
- Generating unique representations
Statistical Metrics
- Arithmetic mean
- Geometric mean
- Median
- Standard deviation
- Variance
- Kurtosis
- Skewness
Basic parameters of statistical data
- Variance and standard deviation
Scaling, centering, standardization
- Formulas for scaling, centering, and standardization of data
Estimation in Statistics
- Estimation of the mean of a distribution
- Point estimation
- Interval estimation (t distribution)
Hypothesis Testing
- One-sample t-test for the mean of normal distribution with unknown variance
Two-Sample t Test for independent samples with equal variances
- Test statistic formula
Types of Statistics
- Descriptive statistics: Continuous, discrete, proportions, central tendency, dispersion
- Inferential statistics: Estimation, hypothesis testing, p-values, confidence intervals
Planning/Design of the Study
- Primary question
- Sample size
- Participant selection
- Group assignment
Biomedical research
- Relationship between patient characteristics/treatments and a health condition
- Establishing cause and effect
Errors
- Nonsampling error
- Sampling error:
- Error of measurement
- Error of non-response
- Error of misinterpretation
- Error of calculation or arithmetical error
- Error of sampling bias
Statistical Community
- One-dimensional versus Multidimensional
- Static versus Dynamic
- Relatively homogeneous versus Heterogeneous
- Qualitative versus Quantitative
Populations and Samples
- Population
- Sample
- Representative sample
- Random sample
Regression and Correlation Methods
- Relate a normally distributed output variable
- Types of correlation: Linear positive, linear negative, non-linear
- Pearson linear correlation coefficient
ANOVA
- Hypotheses for comparing three+ groups
- ANOVA formula/calculation
Telemedicine
- Use of information and communication technologies to transfer medical data;
- Types: Teleoperations, telemonitoring, teledagnosis, teleradiology, telecardiology;
- Models: Direct, intermediate
- Benefits, barriers, needed requirements
Biomolecules
- Primary sequence representation of biomolecules
Molecular Descriptors
- Representation of molecular properties into numbers
- Descriptor dimensionality (1D, 2D, 3D)
Lipophilicity
- Importance in medicinal chemistry
- Methods for measuring lipophilicity
Pharmacophore concept
- A collection of steric and electronic features
- Drug-receptor interactions
QSAR
- Quantitative structure-activity relationship analysis to predict drug activity
QSAR - Data Analysis Techniques
- Approaches for analyzing data (artificial intelligence, neural networks, Hebb rule, Kohonen neural network)
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
Test your knowledge of the drug discovery process with this quiz. It covers various aspects such as drug design techniques, docking processes, and statistical analysis related to sample size. Understand key terminologies like SAR, ADMET, and pharmacophore identification through this engaging quiz.