Question 1

What is the role of information technology in determination of protein properties?

Accepted Answer

Information technology plays a crucial role in determining protein properties by enabling the storage, analysis, and interpretation of large volumes of protein data. Computational tools and software can predict structural and functional properties of proteins based on their sequences, simulate protein folding, and analyze interactions with other molecules. IT facilitates rapid and accurate protein characterization, which is essential for research and drug development. — By using databases, algorithms, and computational models, IT helps in predicting protein structure, function, and interactions from raw sequence data, thus accelerating biological research.

Question 2

What type of protein raw data is used for computationally extracting information about the protein?

Accepted Answer

The primary raw data used for computational extraction of protein information is the amino acid sequence of the protein. This sequence data serves as the basis for predicting secondary and tertiary structures, functional domains, and other biochemical properties using bioinformatics tools. — Protein sequences provide the fundamental information needed for computational analysis, as the sequence determines the protein's structure and function.

Question 3

Name any two common tools for domain prediction.

Accepted Answer

Two common tools for protein domain prediction are Pfam and SMART. Pfam is a database of protein families that includes their annotations and multiple sequence alignments, while SMART is used for the identification and annotation of genetically mobile domains and the analysis of domain architectures. — These tools analyze protein sequences to identify conserved domains, which help in understanding protein function and evolutionary relationships.

Question 4

What is the significance of cheminformatics?

Accepted Answer

Cheminformatics is significant because it applies computational techniques to solve chemical problems, especially in drug discovery and development. It helps in managing chemical data, predicting molecular properties, virtual screening of compounds, and designing new molecules with desired biological activities. This accelerates research and reduces the cost and time involved in experimental procedures. — By integrating chemistry with information technology, cheminformatics enables efficient analysis and visualization of chemical data, facilitating better decision-making in pharmaceutical and chemical research.

Question 5

Which of the following is not a rule in Lipinski's rule of five (RO5)?

(a) No more than 10 hydrogen bond receptors
(b) Partition coefficient $\log P$ of less than 5
(c) Not more than 5 hydrogen bond donors
(d) Molecular weight above $500\mathrm{g/mol}$

Accepted Answer

The correct answer is (d) Molecular weight above 500 g/mol. Lipinski's rule of five states that, for good oral bioavailability, a compound should have a molecular weight less than 500 g/mol, not above it. The other options are correct rules: no more than 10 hydrogen bond acceptors, log P less than 5, and no more than 5 hydrogen bond donors. — Lipinski's rule of five is a set of guidelines to evaluate druglikeness. It includes:
- Molecular weight < 500 g/mol
- Log P < 5
- No more than 5 hydrogen bond donors
- No more than 10 hydrogen bond acceptors
Option (d) contradicts the molecular weight criterion.

Question 6

Which of the following properties of protein is not included in primary structure prediction?

(a) Aliphatic index
(b) Fold prediction
(c) Instability index
(d) Isoelectric point

Accepted Answer

The correct answer is (b) Fold prediction. Primary structure prediction involves properties derived directly from the amino acid sequence such as aliphatic index, instability index, and isoelectric point. Fold prediction relates to the secondary or tertiary structure and is not part of primary structure prediction. — Primary structure refers to the linear sequence of amino acids. Properties like aliphatic index, instability index, and isoelectric point can be computed from this sequence. Fold prediction requires higher-level structural information beyond the primary sequence.

Question 7

What is protein informatics and how does it help in understanding hypothetical proteins?

Accepted Answer

Protein informatics is the collection and analysis of protein information using information technology tools. It helps in understanding hypothetical proteins by predicting their functional sites, biochemical functions, and tertiary structures when conventional methods fail. — Protein informatics involves using computational tools to gather and analyze data about proteins. This approach is especially useful for hypothetical proteins whose functions are unknown, as it helps determine their structure and function using bioinformatics techniques.

Question 8

Which of the following is NOT a type of protein data used in protein informatics?

Accepted Answer

DNA methylation pattern data — Protein data types include crystal structures (PDB), sequences from MALDI, and NMR data. DNA methylation pattern data relates to epigenetics and not directly to protein informatics.

Question 9

Explain the role of microscopic images of heat-denatured protein aggregates in protein informatics.

Accepted Answer

Microscopic images of heat-denatured protein aggregates are used to analyze multi-fractal properties which help in designing protein markers for detection and study. — Heat-denatured protein aggregates show complex fractal patterns. Analyzing these patterns helps in designing markers that can identify specific proteins or aggregates, aiding in diagnostics and research.

Question 10

Which of the following databases is commonly used to obtain raw protein data for protein informatics analysis?

Accepted Answer

NCBI — NCBI provides extensive raw protein data including sequences and structures, essential for protein informatics. CAS and ChemSpider are chemical databases, and PharmaGKB is pharmacogenomics related.

Question 11

Match the following protein data types with their typical use in protein informatics analysis.

Accepted Answer

— Matching protein data types with their uses helps understand how diverse data contributes to protein informatics.

Question 12

Identify the two basic facilities required to carry out protein informatics analysis.

Accepted Answer

The two basic facilities required are (i) availability of raw protein data from databases like NCBI, PDB, CHEMBL, and (ii) informatics tools and techniques such as image analysis, sequence similarity, structure optimization, and machine learning methods. — Protein informatics depends on access to raw data and computational tools. Databases provide the raw sequences and structures, while bioinformatics tools analyze and predict protein properties.

Question 13

What is the significance of isoelectric point (pI) in protein primary structure prediction?

Accepted Answer

Isoelectric point (pI) is the pH at which a protein has zero net charge and is most stable and compact. It helps in developing buffer systems for protein purification by isoelectric focusing. — Knowing the pI helps predict protein behavior in different pH environments, crucial for purification and stability studies.

Question 14

Which physico-chemical property of a protein indicates its thermal stability and is defined by the relative volume occupied by aliphatic side chains?

Accepted Answer

Aliphatic index — Aliphatic index measures the volume of aliphatic side chains (A, V, I, L) and correlates positively with thermal stability of globular proteins.

Question 15

A protein has an instability index of 45. What does this indicate about the protein's stability?

Accepted Answer

The protein is unstable — Proteins with instability index above 40 are predicted to be unstable in vitro, while values below 40 indicate stability.

Protein Informatics and Cheminformatics

Protein Informatics and Cheminformatics — Study Notes

10.1 Protein informatics

10.1.2 Protein data types

10.1.3 Computational prediction of protein structures

Practice Questions — Protein Informatics and Cheminformatics

All 12 Chapters in Biotechnology