Live Session1: Integrating MultiomicsApproaches inProteomics Research&theChallenges of Data Analysis
Sintesi
TLDRThe session was a comprehensive exploration of the integration of multiomics data—genomics, proteomics, and metabolomics—targeted at understanding complex biological questions. It focused on new technological advancements in these fields, particularly in single-cell analysis. Technologies like single-cell proteomics and genomics, proteogenomics, and metabolomics were discussed, providing insights into their applications in drug discovery and clinical research. The session demonstrated various instruments and workflows, emphasizing the progression from traditional proteomics to more advanced techniques such as single-cell proteomics and analysis of surface proteins which are crucial in drug targeting. Additionally, cutting-edge tools like mass spectrometers and data analysis software such as MS Fragger were showcased to underline the importance of accurate data interpretation in the burgeoning field of multiomics. Overall, this live session highlighted the critical role of integrating multiple omics layers to improve our understanding and treatment of diseases, offering budding scientists a glimpse into career opportunities within this rapidly advancing field.
Punti di forza
- 🔬 Single-cell proteomics is now feasible with advanced workflows.
- 💡 Integrating multiomics data is vital for comprehensive biological insights.
- 🧬 Genomics, proteomics, and metabolomics together offer a holistic view of biological systems.
- 📊 Big data analytics is essential for handling and interpreting large multiomics datasets.
- 🧪 Surface proteomics focuses on druggable cell surface proteins.
- 🔗 Proteogenomics helps link genomic data with protein expression profiles.
- 🚀 Emerging fields include single-cell analysis and post-translational modification studies.
- 🔍 Data analysis tools like MS Fragger ease the handling of mass spectrometry data.
- 💼 Growing career opportunities in multiomics fields, particularly in data analysis.
- 📈 Advanced mass spectrometry techniques crucial for detailed proteomics analysis.
Linea temporale
- 00:00:00 - 00:05:00
The session starts with an introduction to the live demo, focusing on the integration of multi-omics data, specifically proteomics and genomics, to address biological questions. Single-cell proteomics is highlighted as an emerging field with advancements allowing a more comprehensive analysis at the single-cell level.
- 00:05:00 - 00:10:00
The speaker discusses cell surface proteomics and its significance for drug target discovery by focusing on cell surface proteins. This includes biological developments in understanding post-translational modifications like phosphorylation and glycosylation, emphasizing the importance of instruments like Orbitrap for high-resolution analysis.
- 00:10:00 - 00:15:00
A typical workflow for quantitative proteomics using biological samples is explained. The focus is on mass spectrometry for protein analysis, discussing sample preparation, ionization, and data acquisition. High-resolution mass spectrometry facilitates exploring proteomics in depth.
- 00:15:00 - 00:20:00
The session shifts to discussing Tandem Mass Tag (TMT) for proteomics, illustrating its use for sample multiplexing for quantitative analysis. This involves tagging peptides, running samples on mass spectrometry, and analyzing the data based on intensity for quantitative results.
- 00:20:00 - 00:25:00
The demonstration describes optimizing TMT methods to improve data accuracy. The process involves parameter settings in mass spectrometry to ensure reliable data acquisition and analysis, highlighting the importance of adequate experimental settings for effective proteomics.
- 00:25:00 - 00:30:00
MRM and PRM workflows for targeted proteomics are discussed, focusing on the validation of proteins using triple quadrupole instruments. These methods help confirm quantitative findings from earlier analyses, providing robust validation techniques.
- 00:30:00 - 00:35:00
Highlighting integration with metabolomics, the session discusses the analysis of metabolites using mass spectrometry, illustrating the potential of combining proteomic and metabolomic data from the same sample for a more comprehensive biological insight.
- 00:35:00 - 00:40:00
The use of next-generation sequencing in genomics is introduced for comprehensive genetic analysis using the Ion Torrent S5. The discussion includes steps for preparing libraries and amplification, enhancing genomic data integration with proteomic data for holistic insights.
- 00:40:00 - 00:45:00
A focus on various bioinformatics tools to analyze proteomics data is demonstrated, emphasizing the use of free and powerful tools like MS Fragger for analyzing complex mass spectrometry data efficiently, showcasing the analytical workflow from data acquisition to interpretation.
- 00:45:00 - 00:50:00
Discussing the integration of multiple data types (genomics, proteomics, and metabolomics), the speaker emphasizes the importance of multi-omics in understanding complex biological systems, with a drive towards precision medicine and system biology insights.
- 00:50:00 - 00:55:00
Demonstrations continue with insights into the statistical and computational challenges of handling large datasets in modern research environments, indicating cross-disciplinary efforts necessary for advancing biological research.
- 00:55:00 - 01:04:00
The session concludes by highlighting educational initiatives and career prospects in proteomics and multi-omics fields. The emphasis is on the growing demand for expertise in data analytics and the potential for integrating omics data to drive innovation and discovery in biomedicine.
Mappa mentale
Domande frequenti
What was the main theme of the live session?
The main theme was integrating multiomics data, such as genomics, proteomics, and metabolomics, to answer biological questions.
What technologies are being advanced in the field of single-cell analysis?
Single-cell genomics, mainly transcriptomics, is established. New proteomics workflows and sample preparation advancements now allow for single-cell proteome analysis.
What challenges exist in proteomics regarding low-abundance proteins?
Challenges include high-abundance proteins overshadowing low-abundance ones, making sample preparation and fractionation crucial to enriching low-abundance proteins.
How is the field of proteomics shifting focus regarding drug target discovery?
Proteomics is focusing more on cell surface proteins, which are potential drug targets, instead of global proteomes.
What tools were demonstrated in the session?
Tools like mass spectrometers, software for mass spectrometry data analysis like MS Fragger, and NGS instruments were demonstrated.
What are some emerging fields within proteomics?
Emerging fields include single-cell proteomics, surface proteomics, and analysis of post-translational modifications (PTMs) like phosphorylation and glycosylation.
Why is big data analysis important in the context of multiomics?
Big data analysis is crucial for handling and integrating large datasets from genomics, proteomics, and metabolomics to build comprehensive models of biological systems.
What are the potential career opportunities in the field of multiomics?
There are many opportunities, especially in data analysis, given the significance and complexity of handling multiomics datasets.
What were some of the technological highlights presented?
Technologies such as high-resolution mass spectrometers, new workflows for protein and metabolite enrichment, and proteogenomics approaches were highlighted.
Visualizza altre sintesi video
How to Properly Use Margin with Interactive Brokers
The Alt-Right Playbook: How to Radicalize a Normie
How to Learn to Draw Faster, With Better Results!
INFPs: How To Be Productive Without Selling Your Soul
The art of innovation | Guy Kawasaki | TEDxBerkeley
Divisible and Indivisible Obligations. Article 1223 to 1225. Obligations and Contracts. Civil Code.
- 00:00:15sir you're live now you can
- 00:00:23start all right good morning everybody
- 00:00:25uh welcome to today's live session uh
- 00:00:30today we are going to plan to give you
- 00:00:32some demonstrations from the lab uh
- 00:00:35along with interacting with you life so
- 00:00:38this could be your opportunity to talk
- 00:00:39to us and ask questions but while you
- 00:00:43are uh uh going to do that we are uh
- 00:00:46going to show you some experiment in a
- 00:00:48more thematic way uh today's theme is
- 00:00:51going to be on integrating multix data
- 00:00:54set to understand the uh different
- 00:00:56biological questions and in this
- 00:00:59particular session we are going to show
- 00:01:00you some live experiments from different
- 00:01:03type of proteomics and genomic
- 00:01:05Technologies but then more Focus will be
- 00:01:07how to do the data analysis and
- 00:01:09integrate data from proteomics and
- 00:01:12genomics so uh rather than making it
- 00:01:15very theoretical uh I thought to keep it
- 00:01:18very minimal in terms of talking to you
- 00:01:19on the slides but we do more handson in
- 00:01:22the lab uh but I'll give you very quick
- 00:01:25flavor of you know where proix has
- 00:01:27started making more impact understand
- 00:01:31biology and addressing many clinical
- 00:01:33questions right so something which is
- 00:01:36coming up very new is about single cell
- 00:01:38p and I thought to take this as the
- 00:01:40first topic because one of the students
- 00:01:42sh she asked a question about how the
- 00:01:45field of single cell genomics and promic
- 00:01:47is emerging are going to really make
- 00:01:50impact so in this SL uh I'd like to say
- 00:01:53that single cell uh genomics mainly
- 00:01:55single cell transcriptomics is already
- 00:01:58very established and successful approach
- 00:02:01and people realize that you know they
- 00:02:02can do sufficient RNA and do the transic
- 00:02:05analysis with RNA seek but the challenge
- 00:02:07was to look at from Singles at how to do
- 00:02:10the entire proteum analysis but now with
- 00:02:13the uh advancements of the new type of
- 00:02:16mpic workflows and the new sample prep
- 00:02:19it has really now started becoming a
- 00:02:21reality that one could even look at the
- 00:02:23proteum from single cell right so what I
- 00:02:25showed you here on the slide think about
- 00:02:27if we have to work on the cancer you
- 00:02:30know to look for The Unique proteins
- 00:02:33linked to given cancer uh what we do in
- 00:02:36our own research which is a reality and
- 00:02:38and others also do we take the tumor
- 00:02:41tissue and that tumor tissue is a lump
- 00:02:43of the mass of many type of cell right
- 00:02:46we homogenize them Li them and in a in a
- 00:02:49hope that we look at only cancer rated
- 00:02:52protein but actually not no that's not
- 00:02:54true because we are mixing a variety of
- 00:02:57cell type in this example if you see
- 00:02:59neutrophil macrofagos blood vessels t-
- 00:03:02cell fibroblast all of them along with
- 00:03:04the cancer cell now if you were to
- 00:03:06disaggregate this tumor mass and look at
- 00:03:09you know separate components of them
- 00:03:12then only you know you can see you have
- 00:03:14very few cancer cell but there are lot
- 00:03:16of other type of abundant cells which
- 00:03:18are present there so in which way we can
- 00:03:21try to First Look at the cell sorting
- 00:03:24and look at the Single Cell of our
- 00:03:26interest which we are Keen to look at
- 00:03:28the proteum or transcrip
- 00:03:30to and based on that now can we do
- 00:03:33sufficient ways of sample prep and
- 00:03:36enrichment that we can get you know a
- 00:03:38lot of peptides from that given cell and
- 00:03:40one could run in the massp what you see
- 00:03:42in the bottom panel to generate those
- 00:03:44Mass Spectrum data and then look at the
- 00:03:46data analysis and come up with lot of
- 00:03:48you know different type of patterns so
- 00:03:50this was definitely not possible a few
- 00:03:52years back but now from last year and so
- 00:03:55there's been a lot more progress being
- 00:03:56made in this area that now we uh have
- 00:03:59have good ability to look at the uh from
- 00:04:02single cell even close to 5,000 proteins
- 00:04:053,500 to 5,000 protein is what people
- 00:04:08are able to now achieve with the newer
- 00:04:10uh advancements of the MPC like Tim sto
- 00:04:13and estal orbit trap uh one could really
- 00:04:16you know go to much deeper coverage with
- 00:04:18the Single Cell as well but also like
- 00:04:20sample becomes very crucial because you
- 00:04:22need to make much more automated
- 00:04:24processes so that you're you're not
- 00:04:26losing the much peptide and also you are
- 00:04:28looking at um how best to uh preserve
- 00:04:32the peptides which you lost in the
- 00:04:33multiple step so single po analysis
- 00:04:36Scopes method there are many type of you
- 00:04:37know new workflow which has emerged
- 00:04:39which I'm not going to talk as a lecture
- 00:04:41here but that just to give you flavor
- 00:04:43that single cell genomics and proteomics
- 00:04:46and soon single cell proteogenomics will
- 00:04:48be a reality which is almost you know we
- 00:04:50are making good advancement in this area
- 00:04:52so now just another exciting thing is
- 00:04:54happening in the proteum X right now to
- 00:04:56look at the cell surface
- 00:04:58proteum While most of the time we take
- 00:05:00the again the the entire cell and try to
- 00:05:02Li the cell and look at every protein
- 00:05:04inside that which is the global proteum
- 00:05:06but if you think about lot of drug
- 00:05:08targets they work only on the surface
- 00:05:10right so we should be focusing for the
- 00:05:13drug Target Discovery more of the
- 00:05:14membrane protein and the things are more
- 00:05:16receptors and present outside the
- 00:05:18membrane but when you combine everything
- 00:05:20you are actually diluting with the
- 00:05:22global abundant highly abundant protein
- 00:05:24to these low abundant and more unique
- 00:05:26targets so how to enrich only the
- 00:05:29specific cell surface protein is another
- 00:05:32emerging area in the proix which are
- 00:05:35couple of you know good successful
- 00:05:36examples shown here with also our
- 00:05:38collaborator at UCSF who are developing
- 00:05:41the new workflows to only enrich the
- 00:05:42cell surface protein and again with the
- 00:05:45these Advanced m is possible to
- 00:05:47analyze thousands of protein from these
- 00:05:50type of unique cell surface protein but
- 00:05:52again biologically is are very important
- 00:05:55development which are happening in the
- 00:05:56field of proteomic right but if you
- 00:05:59think about the understanding the
- 00:06:01biology and looking at diseases It's
- 00:06:03always important to look at the post
- 00:06:05transation modification again it was
- 00:06:07very difficult to look at various type
- 00:06:09of PTM analysis because if you think
- 00:06:12about the from the entire Global protein
- 00:06:15what percentage gets phosphorated even
- 00:06:17less than 1% so how to enrich those
- 00:06:20phosphopeptide and then try to analyze
- 00:06:23the proteum from that type of know less
- 00:06:26abundant uh samples has been challenged
- 00:06:29so example enrichment using different
- 00:06:31type of iMac columns and different type
- 00:06:33of tianium dioxide based Technologies to
- 00:06:35enrich the phospho peptides followed by
- 00:06:39running them in the mass spectometer
- 00:06:40with very sensitive instruments and
- 00:06:43acquiring all type of phosphor related
- 00:06:45proteins or the phosphor map of the
- 00:06:47proteum is another thing which is
- 00:06:49emerging very heavily and not only like
- 00:06:51you know phospho for that matter even
- 00:06:53glyos are very important and very
- 00:06:55complex and very
- 00:06:56challenging and then think about biology
- 00:06:58for UB and acation and variety of type
- 00:07:02of degradation and many type of other
- 00:07:04type of modifications are also equally
- 00:07:06important to understand the biology and
- 00:07:07disease so a lot of exciting things are
- 00:07:10happening in this area of course in your
- 00:07:12theoretical course of proteomics and
- 00:07:14proteogenomics we did not have time to
- 00:07:16touch upon all the latest advancements
- 00:07:18but these are you know kind of some of
- 00:07:20the quicker updates and again as I said
- 00:07:22was not possible without the some of the
- 00:07:24very latest high resolution mpect like
- 00:07:27from the orbit trap Asal orbit trap
- 00:07:29happen with the Tim stop mtic without
- 00:07:31them it was not possible to go to that
- 00:07:33level of depth of these very important
- 00:07:36class of proteins of membrane proteins
- 00:07:38and the ptms or looking at even the
- 00:07:40Single Cell proteum but these are
- 00:07:42started getting more and more reality
- 00:07:43which actually adds lot of excitement
- 00:07:45for all of you who are learning these
- 00:07:47new areas and want to venture into this
- 00:07:49new field by taking up the exciting
- 00:07:51research problems in the future so uh
- 00:07:55while this are some latest development
- 00:07:57but what is the typical workflow of
- 00:07:59doing protomet for all of these whatever
- 00:08:01I spoke or in general if you want to do
- 00:08:03qu quantity protomet a typical workflow
- 00:08:05involves that you take some biological
- 00:08:07sample it could be a cell line a
- 00:08:10biological tissue sample or various type
- 00:08:12of body fluid it could be saliva it
- 00:08:14could be serum it could be C spinal
- 00:08:15fluid many type of biological fluid
- 00:08:17samples right all of them you stract the
- 00:08:19protein you digest them with proteases
- 00:08:23like a tripin choten ly make them
- 00:08:26peptide form and then those peptides you
- 00:08:28are ionizing them with the
- 00:08:31electronization
- 00:08:32ESI after ionization they are going to
- 00:08:35go inside the mass spectrometer which
- 00:08:37could be vary of MPS and then you are
- 00:08:39acquiring their M byz and then you are
- 00:08:42generating their MSS Spectra or msms
- 00:08:44Spectra and going to analyze that data
- 00:08:47typical pic workflow whatever type of
- 00:08:49thing I'm talking they will fall in in
- 00:08:52this kind of domain only thing which
- 00:08:54becomes very critical is the sample
- 00:08:55preparation part which we already spoke
- 00:08:58in the last live session about different
- 00:08:59type of sample prep how to make it more
- 00:09:01unique and and and en that kind of
- 00:09:03sample right but eventually as you go
- 00:09:06along there are different type of
- 00:09:07workflow being followed to look into the
- 00:09:11quantitative type of workflows or
- 00:09:12looking at a specific modification and
- 00:09:14accordingly you to fine tune your
- 00:09:16workflows for the data acquisition at
- 00:09:18the mpect level and data analysis level
- 00:09:21so uh we have this uh high resolution
- 00:09:23Mass spectrometry facility at it Bombay
- 00:09:26which we have established as additional
- 00:09:27facility which is equipped with iy of
- 00:09:30instruments uh and what we thought we
- 00:09:32are going to show you some of these
- 00:09:33instruments how they can be utilized for
- 00:09:36different type of workflow so for
- 00:09:37example here you
- 00:09:38see orbit Fusion which is really
- 00:09:41utilized for the D proteum type of
- 00:09:43analysis or defo protom analysis utive
- 00:09:47along with the HC configuration is
- 00:09:50heavily utilized for metabolomics and
- 00:09:52triple qurol instrument is used for the
- 00:09:54targeted toomic or metabolomic based
- 00:09:57analysis so we will you know take more
- 00:10:00time in the lab and try to walk you
- 00:10:01through with these type of workflows but
- 00:10:04to begin with uh we will first talk to
- 00:10:07you about one of the quantitive plumix
- 00:10:09based approach which is still pretty
- 00:10:11powerful which is known as the TMT or
- 00:10:14tendem mass Char so idea is can you uh
- 00:10:18you have three control sample and three
- 00:10:20treatment samples you want to do
- 00:10:22quantitative analysis quantity
- 00:10:24proteom you digest the protein and after
- 00:10:27that you are after digestion you have
- 00:10:29these uh
- 00:10:30peptides uh and those n terminal
- 00:10:33peptides being labeled with these
- 00:10:35isobaric tag which is having different
- 00:10:38type of reporter ions for example tandom
- 00:10:40Mass tag reporter 126 till TMT 131 these
- 00:10:44are all reporter ions these are isobaric
- 00:10:47although reporter are different but then
- 00:10:48you add the carbonal group to make them
- 00:10:50all same mass and then you are mixing
- 00:10:52them with each of your six condition so
- 00:10:55now every condition of three control and
- 00:10:57three treatment they all got unique
- 00:10:59barcoded right and now you can mix them
- 00:11:02and after mixing them you run in the MPC
- 00:11:05and then you can when you look at msms
- 00:11:07level you can now segregate these
- 00:11:09reporter I separately and use this
- 00:11:11information of intensity to quantify the
- 00:11:14signal this is one of the very powerful
- 00:11:17way of doing the quantity protox other
- 00:11:20way of doing quantity protx could be
- 00:11:22label free quantification where every
- 00:11:24sample you're not labeling but you are
- 00:11:26going to run them individually in the MP
- 00:11:29and then you are acquiring the data
- 00:11:31which is more like you're building a
- 00:11:32library of every patient is specific or
- 00:11:34sample specific and then you are
- 00:11:36counting the number of Spectra or the
- 00:11:39relative intensity of all these abundant
- 00:11:42peptide and trying to compare those
- 00:11:43across your control and disease
- 00:11:45condition to then come up with quantive
- 00:11:47signal how much F change you can see
- 00:11:49from control and treatment so two
- 00:11:51different ways of doing analysis both of
- 00:11:54these are coming as a part of data
- 00:11:56dependent acquisition there's also new
- 00:11:58emerging field known as Dia data
- 00:12:00independent acquisition which I'm not
- 00:12:01talking right now to keep it simple but
- 00:12:04we will just you know walk you through
- 00:12:05about TMT workflow and how to acquire
- 00:12:08data uh in mpek uh in a live session
- 00:12:11right now which will be done with uh Dr
- 00:12:14ARA benergy he's going to show you the
- 00:12:16workflow and talk to you about the data
- 00:12:19how to acquire and analyze the data so
- 00:12:20ARA please talk about TMT now so ad can
- 00:12:24you help me to share the screen
- 00:12:48okay so uh let me this okay so welcome
- 00:12:52everyone uh welcome to the introduction
- 00:12:55to the proteomic course and uh TMT based
- 00:12:58proix so first of all like I will be
- 00:13:00giving like what are the advantages of
- 00:13:03sample multiplexing with the isic mass
- 00:13:05tax though Professor has explained more
- 00:13:08of it like how and where it is useful in
- 00:13:12the F
- 00:13:13of how and where it is useful in the
- 00:13:15field of proteomics it will give you
- 00:13:17increase through through uh when you are
- 00:13:20using isobaric tags due to the reporters
- 00:13:23and the I and
- 00:13:28just yeah
- 00:13:29uh when we are ISO using isobaric tax
- 00:13:32where there is a very few chance like
- 00:13:34the missing values will be very few as
- 00:13:36compared to the level three
- 00:13:38quantification we have a vast range of
- 00:13:40sample flexibility whether it is from
- 00:13:42human or bacteria or plant any species
- 00:13:46of the samples can be originated and uh
- 00:13:49like it can help us in multiplexing
- 00:13:51means comparing more than two samples
- 00:13:53means you can ideally now they have
- 00:13:56taken out a range of 24 and 26 like one
- 00:13:59go you can compare like more than two
- 00:14:02samples at a time and multiple
- 00:14:03comparison will help you to improve the
- 00:14:05statistics now as the professor told
- 00:14:08there is a mass reporter there is a mass
- 00:14:10normalizer and there is a protein
- 00:14:12reactive group your peptide will be your
- 00:14:14peptide will be uh connected with the
- 00:14:17balancer and balancer is connected with
- 00:14:20the reporter this is the reporter what
- 00:14:21you want to
- 00:14:23see coming up to the workflow as you
- 00:14:26have gone through our earlier session
- 00:14:28where we we have shown you how to
- 00:14:30prepare the samples we generally prepare
- 00:14:32the samples we do a fraction we do a
- 00:14:34labeling of the TMT TX after the
- 00:14:37peptides have been uh generated and
- 00:14:39followed by a cleanup followed by a
- 00:14:41fractionation which can improve the
- 00:14:43coverage followed by the concentration
- 00:14:46data interpretation data analysis can be
- 00:14:48done with the free software such as Max
- 00:14:50and flf as also as the proteum discovery
- 00:14:54which is one of the proprietary software
- 00:14:56now when you are uh delet dealing with
- 00:14:59the optimization of the TMT metods two
- 00:15:02things you have to keep in mind if you
- 00:15:04are using shorter gradients and shorter
- 00:15:06columns that can increase a CO isolation
- 00:15:09but uh if you are using 2 hours gradient
- 00:15:12with a 50 cm column either you have to
- 00:15:14increase the length of the column or the
- 00:15:16length of the gradient a 15 cm column 1
- 00:15:18hour to 1.5 hour of gradient is good
- 00:15:21enough but if you are using 50 cm column
- 00:15:24you can also load 500 nog to 1 microgram
- 00:15:27of load which can be go to throughout
- 00:15:29so this is how the gr establishment can
- 00:15:31be done and as you know like this is a c
- 00:15:35column so more of the hydrophobic
- 00:15:36peptides will be there so this is the
- 00:15:38interface how the method Edition is
- 00:15:40being done so this is the method editor
- 00:15:42which I am going to show you if the time
- 00:15:45permits so here like in the peptide
- 00:15:49quantification you can eily go to the
- 00:15:51TMT ms3 and you can place it here you
- 00:15:54can just drag this PMT and you can place
- 00:15:56it here once that has been done you will
- 00:15:59given an option like how you want to
- 00:16:00analyze your ms1 data and MS2 data uh
- 00:16:04the ms1 can be done in the OT mode and
- 00:16:08because OT have a higher resolution and
- 00:16:10MS2 mode as this orbitary Fusion what
- 00:16:13Professor San showed in our facility is
- 00:16:15of uh like trid Mass spectrometer is one
- 00:16:19of the trid mass spectrometer it has
- 00:16:21three kinds of mass analyzer qu IR graft
- 00:16:24as well as the orbit the second mass
- 00:16:26analyzis is a choice whether if you are
- 00:16:28using a CD you can use OT mode like a
- 00:16:30orbit mode and if you are using C
- 00:16:33Collision induced dissociation you can
- 00:16:35use the iron craft as the mass analyzer
- 00:16:38and thirdly like in the DD MS2 like uh
- 00:16:41which is the main for use for the
- 00:16:42quantification you can use the OT so in
- 00:16:46one like you can go to the 12 lakh
- 00:16:48Delton like the 2 lakh Delton of the
- 00:16:50Orit resolution but you can go in mass
- 00:16:54in TMT Mass tax you can go to a range of
- 00:16:561 lakh 20,000
- 00:16:59and severely like a multiple charge
- 00:17:01State the charge State can be concluded
- 00:17:03include from 2 to six because this is
- 00:17:06one of the Rob which separate from the
- 00:17:08mul because at a single go you can
- 00:17:11monitor more than two chares monotropic
- 00:17:15prepers keep it to simple pepti a full
- 00:17:18msms resolution with Char Dynamic
- 00:17:20exclusion and I res so these are the the
- 00:17:23settings we are good to go and consider
- 00:17:26selecting the most intense ion at a top
- 00:17:29speed
- 00:17:31mode so yeah so now this is done uh when
- 00:17:35you come to the sensitivity and
- 00:17:36specificity the isolation window is
- 00:17:38generally from uh 7 to 2 m the choice
- 00:17:42can be between 5 to 10 and for best
- 00:17:45accuracy top 10 nches of the
- 00:17:47prefractionated samples were taken so
- 00:17:50this is how a ideal parameter or a
- 00:17:54ideal Matrix of anmt workflow line so
- 00:17:57this is a 2hour gradient
- 00:17:59that is 120 Minutes you generally start
- 00:18:01with less hydrophobic means more of the
- 00:18:03Aquas to high hydrophobic which is an
- 00:18:06organic solvent which is ACN and you can
- 00:18:09easily obtain more than 2,000 proteins
- 00:18:11in a single shot and 10,000 to 11,000
- 00:18:15peptide zes in single and that can be
- 00:18:18Quantified with the help of cium
- 00:18:21discover okay so now let me uh let me
- 00:18:24take you to the
- 00:18:26original um mapping of the
- 00:18:30instrument
- 00:18:45just so this is the original interface
- 00:18:48of the video so that 120 hour minutes
- 00:18:50gradient you can see it here this is how
- 00:18:53a chromatogram looks like and if I can
- 00:18:56show you the total iron chromatogram
- 00:18:59uh view iron map okay you can see the
- 00:19:03Spectrum also this is how the Spectrum
- 00:19:05looks like most of the illusion has
- 00:19:07happened from 30 minutes to 110 minutes
- 00:19:09so this is our gradient like the first
- 00:19:11is the aqua space which I have shown you
- 00:19:145% of B then is the hydrophobic face and
- 00:19:17after that again Aquas so if you zoom in
- 00:19:20like
- 00:19:26sorry okay this is hanging
- 00:19:30is hanging a bit so if you zoom in each
- 00:19:33paks represent so this is how the Crux
- 00:19:35lights so you can see each Peak is a Ms
- 00:19:38and the void between the two is the MS
- 00:19:40Ms so this is how a peak is being seen
- 00:19:43so if you click it here let this also be
- 00:19:46there uh you say VI uh Spectrum okay
- 00:19:52great so like you can see so this is the
- 00:19:55MS so once you shift you can see if
- 00:19:58between two Ms if between two Ms if the
- 00:20:01fragmentation is of Ms Ms is more than
- 00:20:0410 because we have selected top 10
- 00:20:06abundant proteins if the transition is
- 00:20:08coming more than 10 it is a very good
- 00:20:10Spectrum at indeed it is a very good
- 00:20:13Spectrum so this is how you can monitor
- 00:20:16how the chromatogram looks
- 00:20:18like once this is done you can see your
- 00:20:22labeling efficiency uh TMT templex sence
- 00:20:26can vary from 126 to uh 132 whatever the
- 00:20:31maybe so you can simply go you can
- 00:20:33simply go here you can uh go on the
- 00:20:40ranges in the
- 00:20:43ranges just a minute let it come in the
- 00:20:46ranges you can select one more column
- 00:20:49where you can select Ms sorry we have to
- 00:20:52do it on the Spectrum so view
- 00:20:55chromatogram once the chromatogram comes
- 00:20:58you you can go on
- 00:21:03the I have already done this
- 00:21:07okay go to the
- 00:21:10ranges once you go to the
- 00:21:12ranges I will show you from
- 00:21:16okay so so this is how it looks like so
- 00:21:20this is a chromatogram view uh you go to
- 00:21:24the ranges and suppose my labeling my
- 00:21:28levels which are there are coming I will
- 00:21:31click on submerge with another graph and
- 00:21:34I will click on the mass range now it
- 00:21:38will give you the mass of the MS
- 00:21:40precursors which have been detected so
- 00:21:43as we know our the labels which are
- 00:21:47occurring in the uh DMT tags varies from
- 00:21:50126 to 130 1 for four place or
- 00:21:55132 whatever may be the ranges
- 00:21:59we will just click on the same so this
- 00:22:02is how you can take out the mass
- 00:22:04range just click on okay it will take
- 00:22:07some
- 00:22:15time so it is going to tabulate and see
- 00:22:19what all precursor which are present
- 00:22:21here are labeled with that kind of
- 00:22:25efficienc Tak a bit of time
- 00:22:32by the time it comes like
- 00:22:35uh uh we can show the
- 00:22:39instrument comes we can show the
- 00:22:45instrument all right so well these are
- 00:22:48the live run which AR gu is about trying
- 00:22:50to show you from
- 00:22:52the uh orbit Fusion which we are
- 00:22:54acquiring the data and again U to really
- 00:22:58get you know proper fragmentation here
- 00:23:00and looking at the uh reporter you need
- 00:23:04to optimize the Collision energy and to
- 00:23:06do lot optimization at the
- 00:23:08chromatography level and msms
- 00:23:10parameters um but eventually uh if your
- 00:23:14labeling was highly efficient you have
- 00:23:17done the proper labeling and if you are
- 00:23:19able to now uh uh look at the data you
- 00:23:23can acquire lot of good quantitative
- 00:23:25accuracy uh what you see now also on the
- 00:23:29um directly from the uh interface let's
- 00:23:34um so AR if you can just unshare your
- 00:23:36screen then we can have you on the full
- 00:23:47screen see where the samples are
- 00:23:50ined already loaded and it is running so
- 00:23:53I cannot open the compartment box of the
- 00:23:55uh Nano LC the sample is eled from here
- 00:23:59and it comes to the or so this is the
- 00:24:03main easy spray source of fusion and you
- 00:24:06can see like this is the Trap column
- 00:24:08what we are having two types of columns
- 00:24:10are used before the sample is injected
- 00:24:12into the instrument one is the tree
- 00:24:14column or the Trap column if any debr
- 00:24:17which is coming from the direct nanc it
- 00:24:19will be trapped here and after that an
- 00:24:21analytical column 15 cm analytical
- 00:24:24column is used in which the samples is
- 00:24:26in
- 00:24:28sample injected so this is the ITC
- 00:24:31isothermal calibration tube once the
- 00:24:33sample goes inside it goes onto the uh
- 00:24:36instrument and after that you can see
- 00:24:38that kind of gradi so coming
- 00:24:43on all right so thank you ARA this is
- 00:24:47one of the workflow of doing the
- 00:24:49quantitive protomet and once you have
- 00:24:53identified let's say lot of protein
- 00:24:55which shows you significant protein lead
- 00:24:58uh which is a good hit for you so now
- 00:25:00you need to validate them right
- 00:25:02conventionally people used to validate
- 00:25:04the targets using antibody based
- 00:25:06approach like Eliza or Western broad but
- 00:25:09for every protein uh we don't have uh so
- 00:25:12nice antibodies available U for many it
- 00:25:16is only polyon available so then how to
- 00:25:19validate the target so since we are
- 00:25:21acquiring data from masp which is more
- 00:25:24about the peptide sequences and their
- 00:25:26quantification idea came why not we only
- 00:25:29validate the those peptides from
- 00:25:31different type of mpect so we are let's
- 00:25:33say discovering an orbit Fusion but can
- 00:25:35we validate the targets of only specific
- 00:25:38peptides of the proteins of Interest
- 00:25:40using the triple quadrapole based
- 00:25:42instrument or using another type of
- 00:25:43orbit trap instrument so the two
- 00:25:45different workflow which are currently
- 00:25:46being heavily used one is multiple
- 00:25:49reaction monitoring other is a par
- 00:25:51reaction
- 00:25:52monitoring multiple monitoring is one of
- 00:25:55the uh triple Quadra pole based
- 00:25:57configuration when you are having the
- 00:26:00two set of quadr q1 and Q3 in between
- 00:26:03you have Collision cell and you already
- 00:26:05know that which of the specific protein
- 00:26:08let's say five proteins of that 15
- 00:26:10peptides of your interest you have told
- 00:26:13the instrument with a hardware with a
- 00:26:15spe specific software on the skyline
- 00:26:17that you're only going to look at the
- 00:26:18transitions of these 15 peptides and
- 00:26:22only those the q1 is actually acquiring
- 00:26:24data and doing fragmentation to generate
- 00:26:26the relative iion uh spectrum of
- 00:26:29PRM is much more powerful it can
- 00:26:31actually look at all of
- 00:26:32those precursors and fragment ties
- 00:26:35simultaneously so then one could go much
- 00:26:38higher number of transitions as compared
- 00:26:39to the mrm based approach so in which
- 00:26:43way these experiments can be done in the
- 00:26:44lab let's have another live
- 00:26:47demonstration from ainash who will walk
- 00:26:49you through these workflows quickly
- 00:26:51directly on the instrument so ainash
- 00:26:54please show the mrm based workflow hello
- 00:26:57Al so as you can see already discussed
- 00:27:00about the targeted programing so this is
- 00:27:01the T volage from the th scientific and
- 00:27:05this has the basically the triple Quadra
- 00:27:07so are the two marer and along with that
- 00:27:10this is the L system connected so this
- 00:27:12is the system and every LC system
- 00:27:15basically multiple compartment this is
- 00:27:18basically Auto sampler where we keep our
- 00:27:20samples like this kind of files and so
- 00:27:23these are the different kind of tray
- 00:27:25where we keep our samples and from these
- 00:27:27samples they go to the like like this is
- 00:27:30basically the pump compartment and this
- 00:27:32one is the and yeah so you can see these
- 00:27:35are the pump compartment where all the
- 00:27:37solvents like have been mixed and get
- 00:27:40eluted during according to the our what
- 00:27:43we said and this is the column
- 00:27:46compartment where we keep our
- 00:27:47compartment like keep our column and it
- 00:27:50maintains some internal temperature for
- 00:27:52the columns and these are the solventes
- 00:27:55from this solvent the different gradient
- 00:27:57has been prepared
- 00:27:58so as you can see in the monitor so this
- 00:28:00is
- 00:28:01basically liquid system application so
- 00:28:06this is the pump module where we can set
- 00:28:08our gradient and this one is a sample
- 00:28:10and we can set like set like where is
- 00:28:13our sample is present and this is the
- 00:28:14column compartment and we can set the
- 00:28:16temperature and uh so all the method
- 00:28:19file which has been used for the
- 00:28:21targeted pics are set in the UN
- 00:28:23application and uh and those method has
- 00:28:26been like set in the X Co caliber so as
- 00:28:27you can see this is a live run going on
- 00:28:29so this are the chromatogram and
- 00:28:31different kind of I and the
- 00:28:34gradi here so as you can see like it
- 00:28:38depend on the what chromat like what
- 00:28:40gradiate we set according to that they
- 00:28:41get uted and this kind of chatram come
- 00:28:44so if I can also show so this is the one
- 00:28:47of the sample R from the tumor sample so
- 00:28:50this is not the Gan distribution as we
- 00:28:52see the see in the global prot this is a
- 00:28:54quite like like not that dense
- 00:28:58because this is we are looking for the
- 00:29:00only the targeted proteins and peptides
- 00:29:03so that's why is not there so I can show
- 00:29:05you some of the result from so after
- 00:29:07that getting the raw files this kind of
- 00:29:09raw file we can import those raw file in
- 00:29:12the different another software which is
- 00:29:14called as a Skyline and we can look for
- 00:29:16the peak of the particular protein so
- 00:29:18this is one of the study in the from the
- 00:29:20co so as you can see so upper top top
- 00:29:22panel is basically the co positive and
- 00:29:25lower panel is basically the
- 00:29:28so we have identified the multiple
- 00:29:30protein which are basically shown the
- 00:29:32differential regulation in the different
- 00:29:34set of a group so as you can see the
- 00:29:36also the intensity of in the like
- 00:29:38negative sample and the positive sample
- 00:29:40so basically this instrument has been
- 00:29:43very good for the doing the targeted
- 00:29:44proteomics and from the what after
- 00:29:47analyzing theic global data set so yeah
- 00:29:51over
- 00:29:52to all right thank you Vash U so now
- 00:29:57let's say say you already have looked at
- 00:30:00you know your specific uh protein
- 00:30:02targets and you are able to now
- 00:30:04correlate that intensity what you have
- 00:30:06observe the four changes they are same
- 00:30:08whether you look for the discovery or
- 00:30:10not in the validation stage additionally
- 00:30:13from the since we're talking about mpek
- 00:30:16mpek is also very powerful lcmsms
- 00:30:18approach to look at the metabolites in a
- 00:30:21field on as metabolomics right the way
- 00:30:23we look at all the proteins and
- 00:30:24proteomics likewise masp is very
- 00:30:26powerful to look at all the metabolomics
- 00:30:29for all the metabolites although there
- 00:30:31other techniques as well like NMR can
- 00:30:33also be used gcms can also be used they
- 00:30:35give you very specialized proces of
- 00:30:37metabolite but a global broad survey can
- 00:30:40be done with the lcms based approach so
- 00:30:43since let's say you think about you have
- 00:30:45biological sample or tissue or cell
- 00:30:46liate from which you want to do
- 00:30:48proteomic you're going to extract the
- 00:30:50protein out why not you know you also
- 00:30:53extract the metabolite out on the same
- 00:30:54sample for example you know if you're
- 00:30:56doing methanol based PR reputation you
- 00:30:59can even use the same sample for even
- 00:31:00some part for the protein some part for
- 00:31:02the metabolite analysis right how
- 00:31:04powerful that could be if you are
- 00:31:06integrating the data from the same
- 00:31:07sample today's live session is also for
- 00:31:10the broader audience talking about
- 00:31:11proteogenomic and multiomic so now we
- 00:31:14want to expand your view think about
- 00:31:16from the same sample you are generating
- 00:31:18next layer of information at the
- 00:31:20metabolite level so metabolomics can be
- 00:31:22done using various type of mass
- 00:31:24spectometer one which we are going to
- 00:31:26show you the another orbit trap Q
- 00:31:28executive which is interfac with
- 00:31:30uhplc and it is pretty straightforward
- 00:31:33much simpler to do metabolomics as
- 00:31:35compared to the proteomic of course you
- 00:31:37need to optimize multiple parameter
- 00:31:40which now an is going to quickly
- 00:31:41demonstrate you metabolomics based
- 00:31:44workflow directly from the
- 00:31:49lab yeah hello everyone good morning so
- 00:31:52now we will be sh you about the running
- 00:31:55of the sample for the metabolite
- 00:31:57identification
- 00:31:58so as it was discussed that in the prot
- 00:32:01there is the nanc or the Nan flow of
- 00:32:03theow of the p and solution but here for
- 00:32:07the metabolite what we generally use is
- 00:32:09the HC here we uses the flate you can
- 00:32:12see it is in the range of ml for ml per
- 00:32:15minute so here is the same like in the
- 00:32:18alage what a has showed there's the same
- 00:32:20compartment of a and the column now
- 00:32:23metabolize can be divided into three or
- 00:32:27more classes on the basis of their
- 00:32:28hydrophobicity so there can be
- 00:32:30hydrophobic it can be in between
- 00:32:33amphilic it can be hydrophilic so on
- 00:32:34that basis of that the particular column
- 00:32:37is used now if you are want to out only
- 00:32:40the separate the identify hydrophobic so
- 00:32:43then a C8 column as like it is being
- 00:32:46used in the case of prot so then same
- 00:32:49reverse B chromatography is being
- 00:32:53principle is being followed where the in
- 00:32:55the column compartment the column is
- 00:32:57being added and in the solution you can
- 00:33:00see that it is being made uh the
- 00:33:03different layers different gradient of
- 00:33:05hydrophobicity is being generated like
- 00:33:07you can see here and on the basis of the
- 00:33:11hydrophobicity compounds are eluted out
- 00:33:13now after eluting out as it goes into
- 00:33:17the mass spectrometer and this Mass
- 00:33:19spectrometer is a high mass spectrometer
- 00:33:22where it is consist of the quadruple and
- 00:33:24the orbit trap so there in this two Mass
- 00:33:26analyzers first
- 00:33:28the precursor ion is getting selected at
- 00:33:31ms1 level and then finally at the after
- 00:33:34the fragmentation MS2 level is done so I
- 00:33:36just quickly show you some runs that are
- 00:33:40going on so you can see that these are
- 00:33:42the Spectra at the msms level so the
- 00:33:46metabolites now one thing that needs to
- 00:33:48be understand that metabolites can be
- 00:33:50identified at ms1 level with the
- 00:33:53compound molecular weight but same
- 00:33:54compound can have different molecular
- 00:33:56weight so for this we need to check the
- 00:34:00m to match with the known compound that
- 00:34:03particular compound will have a unique
- 00:34:05fragmentation picture so this is the Run
- 00:34:07going on with the chromatogram on the
- 00:34:09basis of the gradient the particular
- 00:34:11compounds are getting analized and here
- 00:34:14if you see the runs that has occurred so
- 00:34:17here you can see this is the particular
- 00:34:19compound that has eluted out now in the
- 00:34:20case of metabolic we also need to add
- 00:34:23some kind of internal standards and run
- 00:34:26QC pools to check that our instrument is
- 00:34:29not there's no technical variability in
- 00:34:32between our run so for example this is
- 00:34:33one of the standard which has the which
- 00:34:36should be eluting out at any specific
- 00:34:38time throughout your experiment with a
- 00:34:41certain intensity and a good Pi so this
- 00:34:44is in brief about the metabolomic
- 00:34:47experiment Global met yeah thank
- 00:34:52you all right thank you anit so again
- 00:34:56some of these experiments what you see U
- 00:34:59students who are live U it's not very
- 00:35:02difficult to actually perform these
- 00:35:04experiments right uh but what actually
- 00:35:06becomes difficult to really analyze this
- 00:35:08data because it takes a while to make
- 00:35:10sense of these data while it actually is
- 00:35:12very easy to generate these data but to
- 00:35:15really get the maximum output from um
- 00:35:19using these type of workflows of
- 00:35:21metabolomic or proteomic you generate
- 00:35:24huge amount of data and Analysis and
- 00:35:26workflow that is is kind of very
- 00:35:29challenging U I think know while we are
- 00:35:33going to talk more about other
- 00:35:34Technologies let me just take a question
- 00:35:36from a student about
- 00:35:38TMT uh how can we achieve High
- 00:35:41sensitivity in detecting low abundance
- 00:35:44protein using TMT based
- 00:35:45proteomics so uh in prot is always a
- 00:35:49challenge that you know
- 00:35:51whatever entire complex protein we want
- 00:35:53to analyze there are always some
- 00:35:55abundant protein which will be having
- 00:35:57more more peptides and they will always
- 00:35:59mask our uh you know
- 00:36:01maspic ionization followed by msms
- 00:36:04analysis so you always see high abundant
- 00:36:07protein much more as compared to low
- 00:36:08abundant protein so it's actually you
- 00:36:11know not the TMT or any quantitative
- 00:36:14strategy which can enrich lot of low
- 00:36:16burent protein but rather your sample
- 00:36:18prep is the first place when you can
- 00:36:20think about how to enrich your complex
- 00:36:23proteum uh many samples like for example
- 00:36:26you work on a plant sample it has the
- 00:36:27plant leaf as Risco protein very
- 00:36:29abundant protein you work on the saliva
- 00:36:32protein you work on the plasma protein
- 00:36:34all of the these samples like you know
- 00:36:36amas or having the
- 00:36:38Albin IGG all of these kind of abundant
- 00:36:41proteins you need to remove them to then
- 00:36:43try to enrich the getting the high
- 00:36:45abundant protein and to be out and low
- 00:36:48abundant to be seen in the m so sample
- 00:36:50prep is the first step which can be
- 00:36:52utilized then once you are let's say
- 00:36:55done the extraction and done the uh your
- 00:36:57proper ways of enrichment for lowan
- 00:37:00protein then next thing is can you
- 00:37:02fractionate those you know so that you
- 00:37:04have from the one complex uh tube which
- 00:37:07has all the 10,000 protein can you
- 00:37:09separate in 10 tubes right and and with
- 00:37:11different type of pH fraction other type
- 00:37:14of chromatographic method or isric Point
- 00:37:16method and fractionary do then each one
- 00:37:18of those when you are going to run in
- 00:37:20the MP you are going to see abundant
- 00:37:22proteins which are low even abundant
- 00:37:24will be enr now TMT will be D
- 00:37:28very powerful because TMT is very
- 00:37:30accurate for the quantification so if
- 00:37:33you have done these kind of pre- sample
- 00:37:35processing and the fractionation then
- 00:37:37TMT based approaches will be able to in
- 00:37:39like you know do more accurate
- 00:37:41quantification of the low abundant
- 00:37:43protein which you are not able to do
- 00:37:45earlier but your sample preparation
- 00:37:47strategy and fracturation strategy will
- 00:37:49make a very important role for giving
- 00:37:52you the broad
- 00:37:53coverage so uh we'll be happy to take
- 00:37:56more questions so please feel free to
- 00:37:58ask uh on the Forum if you have more
- 00:38:00query but let's just take you know kind
- 00:38:03of to broaden the scope we talking about
- 00:38:05multiomic I just talked about proteomic
- 00:38:07and metabolomics think about genomics
- 00:38:09why that is the most important like you
- 00:38:11know the blueprint right so can we even
- 00:38:13try to think about uh integrating the
- 00:38:16data from genome and prum right and that
- 00:38:18sometime especially for disease context
- 00:38:21can be very important uh genomic is very
- 00:38:23powerful for sure and very robust it can
- 00:38:26you know very quickly with the Next
- 00:38:27Generation sequencing can look at all
- 00:38:30possible you know your Gene sequences
- 00:38:32the various variant forms looking at
- 00:38:34mutations all type of things and if you
- 00:38:36were to look at from the same sample the
- 00:38:39protein and the same sample the
- 00:38:40metabolite you're building the layer by
- 00:38:42layer information now from The genome
- 00:38:45level you are able to see okay which are
- 00:38:46all possible mutation from that given
- 00:38:48cancer or given disease you are able to
- 00:38:51see and now using protein you are able
- 00:38:53to further enrich and say okay how many
- 00:38:56modifications I can see what is the
- 00:38:58correlation between these type of
- 00:39:00genotypic changes which we see and
- 00:39:02finally happening at the phenotypic
- 00:39:03level by integrating the functional
- 00:39:05information from the proteome and the
- 00:39:07metabolome so this is where I think
- 00:39:09integrating the piece of information is
- 00:39:11very important and you are taking this
- 00:39:13course on proteogenomics you have gone
- 00:39:15through it and there is you know very
- 00:39:17active effort with the proteogenomics
- 00:39:19community to look at the especially for
- 00:39:21the cancer program which wased by Joe
- 00:39:23Biden about cancer moonshot to look at
- 00:39:26integr the data of different cancer type
- 00:39:29and provide much more powerful
- 00:39:30information as a part of a big
- 00:39:32initiative known as the ICPC
- 00:39:34International cancer protogen Consortium
- 00:39:36where India is also not part of the
- 00:39:38cancer Moon short program so aish is
- 00:39:41going to show you briefly in the lab
- 00:39:43about one of the uh genome sequence and
- 00:39:46and some brief overview about it then we
- 00:39:49will shift to some of the Big Data
- 00:39:50analysis part
- 00:40:00hello everyone I Ai and I'll be giving
- 00:40:02you a demo on the uh NGS instrument that
- 00:40:05we have So currently we have iron
- 00:40:08torrent S5 instrument that you can see
- 00:40:10in your screen I'll give a brief
- 00:40:12overview on the technology and then we
- 00:40:14can talk about the instrumentation so if
- 00:40:16anit can focus on the screen yeah so uh
- 00:40:20I'm going to talk about the iron torrent
- 00:40:21S5 so a brief overview on how you can
- 00:40:24prepare your sample for the genomic so
- 00:40:26first step is to prepare the library and
- 00:40:29to prepare the library very very
- 00:40:31initially you should have a good genomic
- 00:40:33DNA or that or any kind of DNA that you
- 00:40:35want to do for the sequencing you need
- 00:40:38to have that DNA good quantity and you
- 00:40:40quantify that DNA followed by the
- 00:40:44amplification followed by the
- 00:40:46amplification once the amplification of
- 00:40:48the target is
- 00:40:50done uh you have to digest the amplicons
- 00:40:53digest the amplion such as that the
- 00:40:55sticky end yes
- 00:40:58uh I think probably your screen is
- 00:40:59shared you have to stop the sharing of
- 00:41:01your screen then we can see you in large
- 00:41:04because right now it's very small panel
- 00:41:05to really
- 00:41:08view
- 00:41:10okay from your system probably the Have
- 00:41:13you shared the screen something that you
- 00:41:15have to
- 00:41:17stop I have not shared
- 00:41:20anything and this Frame of anit has to
- 00:41:23come on the full screen because you are
- 00:41:25very
- 00:41:27a small manely
- 00:41:29scene
- 00:41:33okay I should come on the full screen
- 00:41:35right of your part then only it will be
- 00:41:38visible now I I think it's
- 00:41:41coming okay now you have to it's PPT is
- 00:41:44coming
- 00:41:45actually yeah so first I'll explain the
- 00:41:48pp and then you can give a demo of the
- 00:41:49instrument all
- 00:41:51right yeah so uh as I was saying the
- 00:41:55first step in the Next Generation
- 00:41:57sequencing is to get a good quantity of
- 00:41:59the DNA and once you have a good
- 00:42:01quantity of the DNA next you can amplify
- 00:42:04the targets there are multiple
- 00:42:05techniques by which uh targets can be
- 00:42:07Amplified one of the most common method
- 00:42:09is the PCR and once the targets are
- 00:42:12Amplified then the sticky ends are
- 00:42:14created as you can see here in the
- 00:42:16workflow the targets are Amplified
- 00:42:18you'll get multiple copies of your
- 00:42:20target DNA and the uh then the sticky
- 00:42:23ends can be created using mild detergent
- 00:42:25once those sticky ends are created
- 00:42:27then the adapter barcodes which are
- 00:42:30essential for the sequencing reaction
- 00:42:32will be added to those sticky ends and
- 00:42:34now these are your libraries which are
- 00:42:36prepared the next step is to amplify
- 00:42:40those Target libraries for that in our
- 00:42:43uh iron torrent S5 we have an instrument
- 00:42:45called iron OneTouch two instrument
- 00:42:47which works on the principle of imulsion
- 00:42:49PCR so uh this is how the instrument
- 00:42:53looks like and you have to fill the
- 00:42:54imulsion mixture here on this on this
- 00:42:58setup and once it's there the imulsion
- 00:43:00reaction will be started and the the the
- 00:43:03libraries that you have prepare during
- 00:43:05your first step of uh NGS will be
- 00:43:08Amplified followed by The Next Step
- 00:43:11would be to enrich those template
- 00:43:13specific uh Target libraries for that we
- 00:43:16have another instrument which is called
- 00:43:18OneTouch es this is how the instrument
- 00:43:20looks like so you will just enrich your
- 00:43:23template specific Target libraries in
- 00:43:25this step once the temp template
- 00:43:28specific Target libraries are enrich in
- 00:43:30this step you will remove those beads
- 00:43:31which are not having any kind of uh uh
- 00:43:34any kind of Library so those libraries
- 00:43:37those beads will be removed in the step
- 00:43:39and you will have proper um like proper
- 00:43:42template specific library and those
- 00:43:44libraries will be loaded onto this chip
- 00:43:47like this as as it is shown here the
- 00:43:50using one ml pip the libraries template
- 00:43:53specific libraries will be loaded on the
- 00:43:55chip I'll be giving a Dem of this chip
- 00:43:58and everything in in a few minutes and
- 00:44:01that's how you can once you have your
- 00:44:03template specific librar are loaded you
- 00:44:05can just load this chip and start the
- 00:44:08sequencing run talking about the
- 00:44:10principle how this iron to S5 works so
- 00:44:13it works on identifying the uh this
- 00:44:16proton ion which is released during the
- 00:44:18formation of phosphor diaster bond so we
- 00:44:20all have learned molecular biology where
- 00:44:22we have learned that during the
- 00:44:23formation of phosphor diester bond
- 00:44:25whenever one nucleotide is added to the
- 00:44:28um to the ongoing stand of DNA there is
- 00:44:31formation of phosphodiester bond which
- 00:44:32is something like this and there's a
- 00:44:34release of proton ion so this instrument
- 00:44:36basically detects this proton ion which
- 00:44:38is released during this step and it
- 00:44:40converts this chemical energy to the
- 00:44:42electrical energy and that's how so this
- 00:44:45chip has a lot of sensors here where
- 00:44:48this proton ion is detected by the by
- 00:44:51the sensor which is located below the
- 00:44:53below the chip and it converts the
- 00:44:55chemical signal to the electrical signal
- 00:44:58that's how the instrument gets to know
- 00:44:59that the sequencing is happening now if
- 00:45:02I'm can show can show the instrument
- 00:45:04I'll give a uh I'll give you the tour of
- 00:45:07the uh
- 00:45:23instrument right so if you can focus on
- 00:45:26so this is the iron torrent S5
- 00:45:28instrument where the sequencing happens
- 00:45:31so before uh to start the sequencing
- 00:45:33reaction you have to clean the
- 00:45:34instrument so I'm going to perform that
- 00:45:36step and I'll show you what all the
- 00:45:38agents and what all cartridges goes
- 00:45:40inside for your sequencing
- 00:45:50so it will take some like just few uh
- 00:45:54time so you can see here it will tell
- 00:45:56you that these are all of the cartridges
- 00:45:59and all of the washing Regents should
- 00:46:01you should have before starting the uh
- 00:46:03cleaning of the instrument and then
- 00:46:04subsequently the sequencing run I'll
- 00:46:06show you uh that's how the inside of the
- 00:46:09instrument looks like
- 00:46:53stopit so I'm not even there I have I
- 00:46:56have left the meetings from
- 00:47:17here all right so
- 00:47:20uh she is figuring out you know way to
- 00:47:23expand her screen uh what is important
- 00:47:26here to think about uh the different
- 00:47:29Technologies available whether for
- 00:47:31looking at your complex pum or genome or
- 00:47:34metabolome but can we really uh be
- 00:47:37creative to look at from the same sample
- 00:47:39and try to enrich that data right um and
- 00:47:42how to really look at the entire thing
- 00:47:44in more totality so now I think you can
- 00:47:46see the IR instrument and I should
- 00:47:48briefly please explain timing is as now
- 00:47:58ai go ahead can
- 00:47:59you okay so sorry for the glitch but now
- 00:48:03I hope you can see the instrument here
- 00:48:06so this is the iron torrent F5 setup
- 00:48:08that we have in our phics lab and before
- 00:48:11going for any sequencing back we have to
- 00:48:13first clean the instrument so I'm going
- 00:48:14to show you how to clean the instrument
- 00:48:16and what all Regents start both inside
- 00:48:19the sequencer so a you can focus on the
- 00:48:22instrument now that's how the instrument
- 00:48:24looks like and I can show you inside the
- 00:48:27instrument where we have all these
- 00:48:29cartridges and everything I'll just this
- 00:48:31chip clamp where we have this chip which
- 00:48:34is very important it some looks
- 00:48:36something like this and there are here
- 00:48:38are a lot of Bio sensors which I was
- 00:48:40talking about and you have to load your
- 00:48:42uh sequencing reaction onto here and
- 00:48:45once your sequencing reaction is loaded
- 00:48:48it will it should be equally spread
- 00:48:50along this chip and then we can gently
- 00:48:52place this chip um onto this chip
- 00:48:56cartridge
- 00:48:58like this it will go smoothly and you
- 00:49:00have to close this chip clamp other than
- 00:49:03chip this is the uh cartridge which is
- 00:49:07the nucleotide cartridge here your all
- 00:49:09the dntps are here and it will go like
- 00:49:12this here like this in the instrument
- 00:49:14this is the wash solution so every time
- 00:49:16once your nucleotide is go uh like uh
- 00:49:19goes to the goes to the instrument this
- 00:49:21is a was solution so every time after
- 00:49:24the one nucleotide the was solution is
- 00:49:26uh will clean the tubes and everything
- 00:49:28this is the cleaning solution which is
- 00:49:30going to be used during the cleaning and
- 00:49:33here we can also U for the um this is
- 00:49:38the waste collecting uh thing where all
- 00:49:41the uh waste of the sequencing reaction
- 00:49:44will go here and you have to clean this
- 00:49:46uh cartridge after the sequencing
- 00:49:48reaction is complete so that's how you
- 00:49:50have to you have to be very uh cautious
- 00:49:53while checking all the things are in
- 00:49:55place and once is
- 00:49:57that sorry to stop you U I think we are
- 00:50:00getting late thank you for this
- 00:50:03demonstration um yeah so broadly now you
- 00:50:06can see a lot of data can be generated
- 00:50:08but how to analyze the data so timing is
- 00:50:10bit short but we will give you at least
- 00:50:13some flavor of uh looking at for the MPE
- 00:50:16data and MPE data analysis can be very
- 00:50:19complicated and the software which are
- 00:50:22currently available they can take you
- 00:50:24like you know good Ram of the inst your
- 00:50:26MPE and it can take few days time but a
- 00:50:30new software which is developed from
- 00:50:32academic Lab Dr Alex's lab uh in
- 00:50:35nashille uh in US Ms Fragger it is one
- 00:50:38of the very highly recommended software
- 00:50:40for you guys to try out even in know own
- 00:50:43laptop you can run some you know complex
- 00:50:45MP data and analyze that so ainash is
- 00:50:48going to give you very quickly just the
- 00:50:51kind of software
- 00:50:52interface uh because we don't have time
- 00:50:54to run the data right now uh but again
- 00:50:56if you have really interest you can
- 00:50:58always ask us we'll be happy to have a
- 00:50:59separate session for you for the data
- 00:51:01analysis part but ainash please uh
- 00:51:04quickly give a demonstration of that
- 00:51:05software yeah so sir can you unare your
- 00:51:10screen yeah thank
- 00:51:19you so uh this is a like sir I already
- 00:51:22mentioned that so this is a your first
- 00:51:24like after opening the software so this
- 00:51:26is the frag pipe 22 version so you can
- 00:51:29easily download this software by using
- 00:51:31going in the Google and just type the
- 00:51:32frag pipe you will easily uh get the all
- 00:51:35the tutorials and the downloading links
- 00:51:37so after downloading the links you will
- 00:51:39get this kind of interface and this so
- 00:51:41here we have to install the multiple
- 00:51:44like like plugins for this software so
- 00:51:48like Ms Fragger iron cont and D rure for
- 00:51:51the different kind of analysis and the
- 00:51:52Dian and the pythons and the and also
- 00:51:54the for the different software for the
- 00:51:56analysis so now we will go for the like
- 00:51:59workflow like how we do the analysis so
- 00:52:01after downloading all the the sessions
- 00:52:03you can just click on the download and
- 00:52:05you can download all the sessions after
- 00:52:07downloading you can go to the workflow
- 00:52:09and you can if so for this analysis I'm
- 00:52:11just showing for the DDA analysis and so
- 00:52:13this is the list of the different kind
- 00:52:15of workflow has been given for the all
- 00:52:17kind of analysis like DDA Dia tmtn you
- 00:52:19can get all kind of data set So
- 00:52:21currently I'm using the lfq MBR for the
- 00:52:23DD files so just click on the lfq MBR
- 00:52:25and you can click click on the load
- 00:52:27workflow and it will get already loaded
- 00:52:29and will load the all the parameters so
- 00:52:32after that you can add the few files so
- 00:52:34I already added here so you can just
- 00:52:36click on this Ms data type you can just
- 00:52:38click on ADD TI file and you can
- 00:52:40download you can just select all the
- 00:52:42files and you can upload here so U after
- 00:52:45that after downloading the file it would
- 00:52:47by default take the data type if it is
- 00:52:50not taking the data type by the by
- 00:52:52default you can just click on that and
- 00:52:54you can change the data type and also
- 00:52:56you can also add the different
- 00:52:58biological group and so in this case we
- 00:53:01have taken the three control and the
- 00:53:02three tumor sample so likewise I have
- 00:53:04annotated so you can just click on the
- 00:53:07those samples and you can change by
- 00:53:09using the uh conjugative Way by file
- 00:53:12name and also you can do by the custom
- 00:53:14way so there are multiple way of doing
- 00:53:16this things so you can uh do do this
- 00:53:18part in the same way you can uh if you
- 00:53:20have the bio replicate and Technical
- 00:53:23technical replicate you can mention this
- 00:53:25part so this is the first part of
- 00:53:26loading the database and after that you
- 00:53:28can directly go to the uh database part
- 00:53:31and you if you uh if you want to like uh
- 00:53:34download or you can upload so there are
- 00:53:36multiple way to uh upload your database
- 00:53:38so some of the few database has been
- 00:53:40already given for the human must mular
- 00:53:42sensor and different other species and
- 00:53:45also it was give the checkbox for the
- 00:53:47review sequence add decoy and the add
- 00:53:49common contamin so for this study this
- 00:53:52is the human cell line so I'm just using
- 00:53:54this by default one and I can just I
- 00:53:56will just click on the okay so after
- 00:53:58that it will be get downloaded and
- 00:54:00suppose if you want to add your own
- 00:54:02database you can just browse and you can
- 00:54:04add your database here and after that
- 00:54:06the next part is basically the all the
- 00:54:08parameter the major parameter we set for
- 00:54:10the analysis so the so you have already
- 00:54:12selected the lfq MBR workflow and but if
- 00:54:15you want to change a few parameters like
- 00:54:17missing cleavage and the enzyme type you
- 00:54:19can change this part so this is the like
- 00:54:22showing the precursor M tolerance and
- 00:54:24the fragment M tolerance so the by
- 00:54:26default it give the 20 if you want you
- 00:54:28can change according to your criteria so
- 00:54:31and also you can also add like what kind
- 00:54:33of tiation has been done so there are
- 00:54:35different kind of drop down has been
- 00:54:37given you can change your enzymes and
- 00:54:39thetic cavage part and this section
- 00:54:41basically you can add your modification
- 00:54:43so by default they add like three
- 00:54:45modification Al so two variable
- 00:54:46modification and one fixed modification
- 00:54:49so one is the oxidation methine and the
- 00:54:52anal acation and the one another is a
- 00:54:55fixed modification have to be quick yeah
- 00:54:57yeah yeah so after that you can just
- 00:54:59directly go to the qu Tams one you can
- 00:55:01check just click on this part and if you
- 00:55:03want to change anything you can change
- 00:55:05here or otherwise you can just directly
- 00:55:07go to the run and go to the browse and
- 00:55:09select the directory where you want to
- 00:55:10save the file and you can just start and
- 00:55:13it will uh just click on the run so it
- 00:55:15will get start running this way and
- 00:55:17after the finishing this job it will
- 00:55:18show this kind of interface to your job
- 00:55:20is done and after running this file you
- 00:55:22will get the output like this so this
- 00:55:25will be output file and you will get the
- 00:55:27combined protein peptide and anotation
- 00:55:28file from this data sets now we will
- 00:55:31move for the like we are rushing for the
- 00:55:33time so we will move to the further
- 00:55:34analysis so you can so currently they
- 00:55:37have added the downstream analysis
- 00:55:40analysis software as well which is the
- 00:55:42track pipe anal if you just click on
- 00:55:43this it will directly open up so I've
- 00:55:46already opened up this software so it
- 00:55:48the interface will be look like this so
- 00:55:50yeah so this will be interface for the
- 00:55:52frag type analyst you can just click on
- 00:55:54the analysis you can select your data
- 00:55:56type so our data type is lfq so we will
- 00:55:58select that and if you can just browse
- 00:56:00the what whatever the file was given so
- 00:56:03you have to select the combined protein
- 00:56:06folder combined protein so it will
- 00:56:09already get uploaded after that you have
- 00:56:11to give The annotation file so this is
- 00:56:13our experimental annotation and after
- 00:56:16that so yes is get downloaded here you
- 00:56:18can select the max lfq intensity and if
- 00:56:21you want to the change like various
- 00:56:23other parameters like missing values and
- 00:56:25the P value criteria and the full change
- 00:56:27criteria and type of normalization so
- 00:56:29you can check on that and also there are
- 00:56:31different type of imputation method has
- 00:56:33been given so Pur type K so those are
- 00:56:36the basically commonly used methods and
- 00:56:39also the Benjamin HB for the FDA
- 00:56:42correction and you just click on the run
- 00:56:44so yeah it will just take the few
- 00:56:46seconds to complete this part and uh it
- 00:56:48will show like all the QC parts so yeah
- 00:56:51as you can see so the first part to
- 00:56:53check the like uh to so these are the
- 00:56:56the basically QC plot to check the data
- 00:56:58quality so you can look for the PCA plot
- 00:57:00sample correlation sample CVS and the
- 00:57:02density plot and the missing value heat
- 00:57:04map and you can also so overall you can
- 00:57:07check all the type of QC plot from the
- 00:57:09this data sets and after that you can
- 00:57:11get the volcano plot and uh yeah so uh
- 00:57:16so as you can see that in this there it
- 00:57:19is showing a four five so we have not
- 00:57:21added the missing value of criteria so
- 00:57:22it will give you the list so you can
- 00:57:24just sort on the basis of the like C
- 00:57:26change criteria and uh like what F
- 00:57:28change com so it is also giving the
- 00:57:30volcano plot and you can download all
- 00:57:32the plots and the tables which has been
- 00:57:34given you can also add like what kind of
- 00:57:37En enrichment you want so and also it if
- 00:57:40you want you can look for the pathway
- 00:57:41analysis this part so this is a very
- 00:57:43convenient pathway like convenient like
- 00:57:46for the all the for the downst strring
- 00:57:48analysis and we can get the de is from
- 00:57:50this and after that we will go for the
- 00:57:52path analysis which like anit will talk
- 00:57:56about that so that's all all right great
- 00:57:59so uh thank you anash I think it was yes
- 00:58:02quick demo but what users and the the
- 00:58:05student who are joining today they can
- 00:58:07see that even looking at uh please stop
- 00:58:10sharing the screen yeah yes even uh now
- 00:58:13analyzing the complex mpect data is very
- 00:58:16much possible and earlier you know we
- 00:58:18used to then look at different software
- 00:58:20uh to do the plotting of the data
- 00:58:22especially the u ms FR like you know
- 00:58:25metab list and percs many other
- 00:58:29software U but now I think you know you
- 00:58:31have advantage that you can use the MS
- 00:58:33frager and generate the from the same
- 00:58:36software you can look at very simp of
- 00:58:37output as well and very fast
- 00:58:40Additionally you can always even do more
- 00:58:42of the protog genomics data analysis
- 00:58:43with the c bioportal and we just inst
- 00:58:46you have already learned some of these
- 00:58:47tools in the protog genomics course if
- 00:58:49you're doing that for students of the
- 00:58:51proteomic you can know that there's a
- 00:58:53word Beyond this proteomic when you can
- 00:58:55now start thinking about integrating
- 00:58:57data and make it more multix you know
- 00:59:00kind of picture and U given the short
- 00:59:03time we will be uh skipping the demo of
- 00:59:06other protog genomics tool U but what
- 00:59:09you have to really uh understand about
- 00:59:11you really want to emphasize that now
- 00:59:13there is a field which is know as multix
- 00:59:15or integrated omx which is now driving
- 00:59:18the systems based approach of
- 00:59:20understanding any complex system much
- 00:59:22more
- 00:59:23comprehensively that's the field which
- 00:59:25is going to really make the difference
- 00:59:26because in in our actual physiology
- 00:59:30whether it's human or other kind of
- 00:59:31living system we don't have the gene
- 00:59:34protein RNA metabolite they don't work
- 00:59:36separately or independently they all
- 00:59:38work kind of like you know in in a very
- 00:59:41relay race where every partner is very
- 00:59:44important right so when we look at only
- 00:59:46one side of the picture only genome or
- 00:59:48only transcriptome or only proteum we
- 00:59:50miss out the complete picture therefore
- 00:59:53uh what is now much more with the robust
- 00:59:55Technologies and instuments available
- 00:59:57but is now much more possible to look at
- 01:00:00different layer of information from the
- 01:00:01same type of sample and that's really
- 01:00:03more important for the complex disease
- 01:00:05like you know for various type of human
- 01:00:06cancer diabetes heart diseases now
- 01:00:09people are trying to look at layer by
- 01:00:11layer information and then now what you
- 01:00:13realize is that since weting so much
- 01:00:15amount of data it can't be only biology
- 01:00:18project now we talking about the big
- 01:00:19data so therefore people from the
- 01:00:21statistical background becomes very
- 01:00:23important to look at the significance of
- 01:00:25the data people from the core
- 01:00:27computational background becomes
- 01:00:28important to look at the entire
- 01:00:31programming and even from the chemical
- 01:00:33background to look at the simulation of
- 01:00:35the data and come up with the much more
- 01:00:36systems modeling to understand a given
- 01:00:39system much more comprehensively so all
- 01:00:41this is really you know moving in the
- 01:00:43direction of what we see like you know
- 01:00:44the new era with the artificial
- 01:00:46intelligence machine learning to really
- 01:00:48make much more predictions but
- 01:00:50predictions will only come if we have
- 01:00:51the robust set of the data to build the
- 01:00:53models so therefore right now this kind
- 01:00:55of field is really uh playing an
- 01:00:58important role to generate very
- 01:01:00meaningful information and data sets
- 01:01:02which can be utilized to make very
- 01:01:04important hypothesis which can be
- 01:01:07further tested with the specific type of
- 01:01:09experiments I hope you got kind of a
- 01:01:11gist of various technology which uh you
- 01:01:13have been studying uh in bits and pieces
- 01:01:15in the course but at least you had the
- 01:01:17firstand opportunity today from the lab
- 01:01:20to see some of these instruments and the
- 01:01:22workflows so with this let me thank all
- 01:01:24the t uh uh who have really given you
- 01:01:28nice demonstration very limited time
- 01:01:29what we had and uh also uh they have
- 01:01:33been following upon all of your queries
- 01:01:35on the Forum and giving you you know
- 01:01:38some meaningful comments on your various
- 01:01:40type of queries uh so thanks to entire
- 01:01:43team and of course thanks to all of you
- 01:01:45the students who are really showing good
- 01:01:47interest into the course are asking some
- 01:01:50very interesting questions to us and lot
- 01:01:51of query which comes to us and hopefully
- 01:01:54like some of you might be liking this
- 01:01:56field to utilize it for your own
- 01:01:57research or to take up the future job
- 01:02:00opportunities and career opportunities
- 01:02:01many student ask us these questions like
- 01:02:03what kind of future opportunities we may
- 01:02:05have in this area and of course you know
- 01:02:07given the significance of this field uh
- 01:02:10you will expect that okay you are going
- 01:02:12to get lot of uh job opportunities in
- 01:02:15this area because people who can uh
- 01:02:17analyze these type of data sets will
- 01:02:19been big demand uh with this I think we
- 01:02:22will close today's live session which
- 01:02:24was our second live session for this
- 01:02:26smuk course and we really wish all of
- 01:02:29you the best for your writing exams if
- 01:02:32you do well in your uh this Muk course
- 01:02:36you will have opportunity to come to it
- 01:02:37Bombay uh spend some time in our lab or
- 01:02:40do some internship or attend some
- 01:02:41Workshop so uh the journey does not end
- 01:02:45here with this kind of you know the
- 01:02:46video courses but rather we want to have
- 01:02:49to see some of you more live in the lab
- 01:02:52and just you know on the lighter note
- 01:02:53there has been several move propers who
- 01:02:55have really you know eventually joined
- 01:02:57our lab and I have been really you know
- 01:02:59very proud to see their career
- 01:03:01progression and our interaction with
- 01:03:02them that they were virtual students who
- 01:03:05you know then perform very well they
- 01:03:07came to join our lab as a project
- 01:03:09student or interns eventually they got
- 01:03:11into the PHD programs or M tech programs
- 01:03:14they finished the programs here and now
- 01:03:15they're doing very well for their post
- 01:03:16do and jobs in Us and other places so it
- 01:03:19has been really you know remarkable uh
- 01:03:21uh positive journey and experience to
- 01:03:23see how the virtual interaction from all
- 01:03:26over India and different parts of the
- 01:03:27world brings people together and then
- 01:03:29you know we are helping them to move
- 01:03:31forward in their career trajectory so we
- 01:03:34wish all of you the best and we'll be
- 01:03:35very much available to take any other
- 01:03:38query and your other kind of future uh
- 01:03:41whatever you know apprehensions you may
- 01:03:43have and questions you may have to give
- 01:03:45you much more you know meaningful
- 01:03:46insights of your query with this let me
- 01:03:49thank the entire team and close today's
- 01:03:51session okay bye everybody
- multiomics
- proteomics
- single-cell analysis
- biological research
- data integration
- genomics
- metabolomics
- proteogenomics
- mass spectrometry
- big data