00:00:02
hi this is a demo of big id's data
00:00:05
intelligence platform focused on catalog
00:00:07
capabilities
00:00:09
big id is the data intelligence platform
00:00:11
to know and manage your data big id is
00:00:14
unique because the platform is built on
00:00:15
a core discovery foundation with machine
00:00:17
learning to know your data
00:00:21
big-id is also unique because the
00:00:22
platform sits at the intersection of
00:00:24
privacy security and governance
00:00:27
and it all starts with a machine
00:00:28
learning augmented catalog
00:00:30
for example big id tackles governance
00:00:33
using machine learning for deep
00:00:35
discovery to add context at scale to
00:00:37
know what your data is and how it is
00:00:39
related and connected to other data
00:00:40
sources in your environment
00:00:43
because you can't govern data unless you
00:00:45
know what your data is and where it is
00:00:47
and what it means or you can't govern
00:00:49
what you don't know
00:00:50
to know your data big id adds
00:00:52
classifiers and tags to identify all of
00:00:55
your data objects
00:00:56
customers can automatically apply
00:00:58
attributes that add context tag objects
00:01:00
that include sensitive information and
00:01:02
classify objects by data type and even
00:01:04
custom sensitivity levels
00:01:07
the big-id catalog is privacy aware by
00:01:09
identifying sensitive information and
00:01:11
even going a step deeper to add policy
00:01:13
tags and notifications for regulation
00:01:14
compliance
00:01:16
with big id data users can see what
00:01:18
their data is if it is sensitive and if
00:01:20
they are at risk of violating any
00:01:21
policies like ccpa or gdpr
00:01:24
bridging into security
00:01:26
the big id catalog shows what data
00:01:28
objects are duplicates or have open
00:01:30
access
00:01:31
this is important for risk minimization
00:01:33
to identify and remediate any data that
00:01:35
is duplicate or sensitive data that has
00:01:37
open access
00:01:38
big id say the catalog provides that
00:01:40
insight to take action to remediate and
00:01:42
then audit to ensure the action was
00:01:44
taken all within the big id platform
00:01:50
for a visual representation of big id
00:01:52
this image illustrates how big id is
00:01:54
reimagining the machine learning data
00:01:56
catalog
00:01:57
the base layer here illustrates all of
00:01:59
the big-id connectors to connect your
00:02:01
data
00:02:02
big-id's data intelligence platform
00:02:04
connects to any and all data source so
00:02:07
structured or unstructured
00:02:08
semi-structured data the data can live
00:02:10
in cloud on-prem and hybrid environments
00:02:13
and even scanning data pipelines
00:02:15
this is a huge differentiator for big id
00:02:17
because the platform truly connects to
00:02:19
any data source
00:02:20
it is unique to be able to manage
00:02:22
structured data with unstructured data
00:02:24
for your complete environment imagine
00:02:26
being able to scan and gain intelligence
00:02:28
about all the files emails s3 buckets
00:02:31
salesforce notes and more in a single
00:02:33
platform
00:02:34
the middle layer is the discovery in
00:02:37
depth this is what big id calls its four
00:02:39
cs and these are four lenses catalog
00:02:42
classify correlate and cluster
00:02:45
lenses to discover define and know your
00:02:47
data to know what your data is and how
00:02:49
it's related to other data in your
00:02:50
environment and that way we can apply
00:02:53
deeper insight and understanding
00:02:56
the top layer is how you action your
00:02:58
data this represents the layer of apps
00:03:00
on top of the platform
00:03:02
and you can choose which apps you want
00:03:03
to apply for example add a data quality
00:03:06
app to measure and manage the quality of
00:03:08
your data and then display those data
00:03:09
quality scores in the catalog for users
00:03:12
to choose data assets with context and
00:03:15
quality
00:03:16
or maybe add a data retention app to set
00:03:18
retention policies to know when data
00:03:20
should be archived or deleted or to
00:03:22
identify data that must be saved for a
00:03:24
legal hold
00:03:25
the apps i'll integrate with the
00:03:26
platform to provide control from a
00:03:28
single point to consistently manage all
00:03:30
of your data and audit actions and tasks
00:03:33
the apps include a variety of
00:03:35
out-of-the-box functions and automation
00:03:37
while enabling customization so for
00:03:39
example you might want to set custom
00:03:41
policies based on priorities for your
00:03:42
organization
00:03:48
these are a series of critical
00:03:50
differentiators for big id we'll focus
00:03:53
on just a couple
00:03:54
first um let's notice that the coverage
00:03:56
of big id is broad so that means that
00:03:59
big id can
00:04:01
can scan and add context to any kind of
00:04:04
data source that's structured or
00:04:06
unstructured in the cloud or on-prem so
00:04:08
data at rest or emotion any kind of data
00:04:10
that you have in your environment big id
00:04:12
can automatically scan and classify and
00:04:15
put into a single catalog view so you
00:04:17
can manage all of your data in one view
00:04:19
with all consistent um consistent rules
00:04:22
and betty has
00:04:24
differentiator in context that we can
00:04:26
discover in depth is the core platform
00:04:29
and so we have customized views to know
00:04:31
your data big id is going to add
00:04:33
different kinds of machine learning and
00:04:34
even has a patented machine learning
00:04:36
engine to know your data with more
00:04:38
context see how split in your
00:04:40
environment and discover insights that
00:04:42
you wouldn't have otherwise had
00:04:44
that's interesting let's transition over
00:04:46
to the live demo and talk about some of
00:04:48
the rest of these differentiators
00:04:56
okay now we are in the big id platform
00:04:58
as you can see this is the dashboard
00:05:00
home screen you can see and some sources
00:05:03
here you can click through you can also
00:05:05
see the physical location of data in the
00:05:07
environment
00:05:08
the first thing we're going to talk
00:05:09
about here in the demo is the content
00:05:12
coverage so as i mentioned big id is
00:05:15
able to find all of your data sources
00:05:17
and types so it could be um structured
00:05:20
or unstructured data it could be on the
00:05:21
cloud on-prem wherever it is big id can
00:05:24
connect to it and add it to the catalog
00:05:27
with automated scanning and discovery so
00:05:31
um you can see here in our data
00:05:32
environment we have a whole selection of
00:05:34
a variety of data sources and types and
00:05:38
i can also very easily add a new data
00:05:40
source
00:05:41
here are a selection of um
00:05:44
pre-configured connectors to make it
00:05:46
super easy to connect your data and as
00:05:47
you can see we have a whole variety of
00:05:50
different data sources and types on-prem
00:05:53
cloud structured unstructured data it
00:05:55
all exists with a very easy to use
00:05:57
connection to import all of your data
00:05:59
into the catalog for a single view of
00:06:02
your data environment
00:06:04
next up we're talking about connectors
00:06:06
profiling and lineage so big id has
00:06:09
native functionality to visualize how
00:06:11
data is connected i'm sure you're all
00:06:13
aware that as far as lineage there are
00:06:16
your lineage providers that have very
00:06:20
sophisticated lineage tools and big id
00:06:23
is able to ingest that lineage content
00:06:26
and then add some um
00:06:28
add context to lineage so now you can
00:06:30
see that lineage with context in big id
00:06:34
another way you can think about lineage
00:06:35
here is with this data processing and
00:06:37
sharing here's just an example of how we
00:06:40
share some lineage and big id about just
00:06:43
how data flows so for example these are
00:06:45
all processes where we know that data is
00:06:47
moving you know from place to place
00:06:48
within an organization and i'm such a
00:06:50
data and map of how data is flowing
00:06:53
through an organization it's just
00:06:54
another kind of view of lineage in terms
00:06:57
of processing and data flows
00:07:01
for profiling the way that we do data
00:07:03
profiling is i can use the catalog
00:07:08
and i can see what kind of um you know
00:07:10
what my data looks like so for example
00:07:14
um let's look here at these attributes
00:07:17
name so i want to look at everything
00:07:19
that big id has determined has the
00:07:21
attribute of name and you can see here i
00:07:23
have a whole variety of different data
00:07:25
sources where name came up which is not
00:07:27
surprising so if i look at this for
00:07:29
example marketing
00:07:32
i can see that right away if i'm looking
00:07:34
for um
00:07:35
you have some profiling information just
00:07:37
to know about this data i can see right
00:07:39
away this marketing contains pi that
00:07:41
means it contains some sensitive
00:07:42
information i can see it was tagged as
00:07:44
high risk i can see more information
00:07:46
that was basically tagged
00:07:48
directly from big id with automated
00:07:50
scanning and context i can see the
00:07:53
attributes so then this marketing data
00:07:56
object i can see it has these different
00:07:57
attributes connected with
00:07:59
different classifiers for countries and
00:08:01
addresses and names which is also had to
00:08:04
be expected in a marketing database
00:08:06
i can see different columns that are
00:08:09
included in this data set so city and
00:08:12
country i can see what type of
00:08:14
characters i can see the attributes and
00:08:16
tags that they are applied here
00:08:19
and then i can see even a preview of the
00:08:21
data to get an idea about what the data
00:08:23
looks like to determine if it's what i'm
00:08:24
really looking for in my data and what
00:08:27
um what the true data
00:08:29
example is i can also interact with the
00:08:31
data another way if i see that there's
00:08:33
something here that doesn't look quite
00:08:34
right to me i can collaborate with the
00:08:36
data owners and i can report an issue
00:08:38
about it to get resolved or i can set
00:08:41
myself here as a watcher of the data to
00:08:42
get alerts for everything changes with
00:08:45
this data set that i am interested in
00:08:48
for data curation here we're going to
00:08:50
add context data the way that big id
00:08:52
does that is kind of our core function
00:08:55
right so we have correlation
00:08:57
um and in correlation we're going to add
00:08:59
context by seeing
00:09:01
where the
00:09:02
where data is um
00:09:05
is related to other data in our
00:09:07
environment so for example here in our
00:09:09
email suggestion we have
00:09:12
email here medical and i can see that it
00:09:15
is here in our patient information and
00:09:17
in our notes information this is letting
00:09:19
me know that the this email attribute is
00:09:23
related to these different um
00:09:26
these different data sources as well and
00:09:28
that i can click through to more of that
00:09:30
too it's all interactive
00:09:32
the way that we're going to interact
00:09:33
with our data is here in the cluster
00:09:36
analysis so i can get more curation i
00:09:38
can get more context from my data here
00:09:40
and cluster analysis by being able to
00:09:42
see what data is duplicate so for
00:09:45
example in this employee information
00:09:47
this is all groups of similar data i can
00:09:50
see based on how big the bubble is and
00:09:52
how far away it is from other bubbles
00:09:54
how closely related it is to other data
00:09:57
types and sources and then if i want to
00:09:58
click into my cluster analysis i can
00:10:01
look into the customer to click into the
00:10:03
cluster and i can see um
00:10:05
you know some let me see queries i can
00:10:07
see top keywords attributes i can see
00:10:10
objects for data source i can get more
00:10:11
information about the things that live
00:10:13
in that in that bubble things that are
00:10:16
similar information which is important
00:10:17
to know if i'm looking for something to
00:10:19
know if i have duplicate data so i know
00:10:21
that i'm using the right one or if i'm
00:10:23
doing a cleanup or if i'm doing a cloud
00:10:25
migration for example to know what data
00:10:26
to to migrate to know what is in my
00:10:28
environment and what i need to to know
00:10:31
about it
00:10:32
i want to show you also classification
00:10:35
big id is going to analyze your data not
00:10:37
just from the metadata but the actual
00:10:39
physical data we're going to analyze
00:10:41
that and determine what it is and then
00:10:43
we're going to add classifier so you can
00:10:44
use a whole selection of big id out of
00:10:47
the box classifications classifiers or
00:10:49
you can set your own custom classifiers
00:10:52
based on what is important
00:10:54
in your organization
00:10:58
we're going to talk next about the
00:11:00
search functionality once we have all of
00:11:02
this data and it's all been
00:11:04
um reported into it's all in found a big
00:11:07
id
00:11:08
i can search here in the catalog
00:11:10
and find that data that i'm looking for
00:11:12
i can either search using these quick
00:11:14
filters here at this side or i can
00:11:16
search using a quick
00:11:18
keyword here i can go back and see my
00:11:21
recently viewed object so i want to
00:11:22
continue working on a project maybe i
00:11:24
was working on earlier or i can type in
00:11:27
a keyword and see and look for some
00:11:29
account as my keyword i can see policies
00:11:32
and policies tables files to see
00:11:35
all of the different
00:11:37
different data objects or policies or
00:11:40
data um the objects that exist in big id
00:11:43
related to my keyword search
00:11:47
for data access
00:11:50
again if we go back to the catalog um
00:11:54
and we look back at any of that
00:11:57
information so for example you're back
00:11:59
in that marketing
00:12:02
we're looking at earlier today i can go
00:12:04
here to marketing and i can x at any
00:12:07
point you'll see this export is here and
00:12:08
i can export data to um to have a report
00:12:11
of the information i'm looking at so if
00:12:13
i'm looking at the os of columns i can
00:12:15
export and i can see that to access to
00:12:18
to use it to integrate with other
00:12:20
solutions as well
00:12:22
next on our list we talk about privacy
00:12:23
controls so we have privacy
00:12:26
notifications just like here contains pi
00:12:28
that just means it contains some
00:12:30
sensitive information depending on how
00:12:32
you define that um privacy controls
00:12:34
means that i can see what data is
00:12:36
sensitive so i can determine how to
00:12:38
protect it who should have access to it
00:12:40
for example um we can do
00:12:43
things like connecting externally to
00:12:45
snowflake and in snowflake we have a way
00:12:48
to say this data was determined to be
00:12:50
sensitive in big id which means that in
00:12:52
snowflake you may need to make sure
00:12:54
we can set some role based access
00:12:57
control and make sure that
00:12:59
only people who should see the data are
00:13:01
able to see it and everyone else if
00:13:03
you're asked to see that data you get
00:13:04
masking so you get masked data and it
00:13:07
happens dynamically so as um imagine
00:13:10
this data is flowing into your snowflake
00:13:11
environment being able to determine what
00:13:13
is sensitive who to have access and then
00:13:15
having that all enabled automatically
00:13:17
which is great because it means you're
00:13:19
going to enable people who should have
00:13:20
access to do very fast access for
00:13:23
analytics and you're going to
00:13:25
make sure your data is protected people
00:13:27
who don't need to have access shouldn't
00:13:28
see it
00:13:30
you also have access controls within big
00:13:32
id so under administration
00:13:34
you can see here we have access
00:13:36
management and that means we're able to
00:13:38
add different users as we add users we
00:13:41
can also assign them a role and then we
00:13:43
can go to their roles and determine
00:13:45
based on their role
00:13:47
we can define
00:13:48
at a very granular level what that user
00:13:51
is able to do or not do it allows us to
00:13:54
kind of give groups of users similar
00:13:56
roles and similar access to have similar
00:13:59
rights within the system for the big id
00:14:02
application itself
00:14:05
finally pricing and support
00:14:07
so on support um big id offers
00:14:09
award-winning customer support we have
00:14:11
staff around the globe we offer
00:14:13
continuous coverage for our customers as
00:14:16
far as pricing pricing is going to be
00:14:18
based on bands of data sources with apps
00:14:21
so
00:14:22
packages are really customized based on
00:14:24
your needs um you can
00:14:26
have more or less data connected and you
00:14:29
can have more or less applications so
00:14:31
these are all the different kinds of
00:14:32
applications that you can choose from
00:14:34
and you can also browse the marketplace
00:14:36
to find more applications that you might
00:14:38
want to add to take action on your data
00:14:40
in big id to learn more about any of
00:14:42
these functions i recommend that you
00:14:44
contact us at bigid.com
00:14:46
and in summary i want to leave you with
00:14:48
the idea that big id is the platform to
00:14:51
know your data it's based on a core
00:14:52
discovery we can connect to all your
00:14:54
data sources structured unstructured
00:14:56
on-prem in the cloud doesn't matter
00:14:58
where it is or what it is we can connect
00:15:00
it all in big id you can see it all in
00:15:02
the catalog and this is really
00:15:04
significant because if you imagine
00:15:06
trying to do all of this manually
00:15:07
discovering your data attacking it i'm
00:15:09
applying policies being able to
00:15:11
determine what is sensitive if you try
00:15:12
to do it manually it's almost impossible
00:15:15
with the rate of data growth and change
00:15:17
that we're seeing so that's why everyday
00:15:19
that's why big id is reimagining data
00:15:22
governance with a data catalog that
00:15:23
connects to all our data sources we
00:15:25
deliver automated scanning and we add
00:15:27
context at scale to know your data
00:15:30
have any more questions we'd be happy to
00:15:32
talk with you please contact us or check
00:15:34
out bigid.com for more information and
00:15:37
thanks for watching
00:15:40
you