BigID Data Catalog Demo

00:15:41
https://www.youtube.com/watch?v=JHOkyYOSoq0

摘要

TLDRBigID is a data intelligence platform designed to help organizations discover, manage, and govern their data effectively. It leverages machine learning to provide deep insights into data relationships and classifications, ensuring compliance with privacy regulations. The platform connects to various data sources, both structured and unstructured, and offers automated scanning and tagging capabilities. BigID's catalog allows users to visualize their data environment, identify sensitive information, and apply governance policies. With features like customizable applications and robust support, BigID aims to simplify data management in an increasingly complex data landscape.

心得

  • 🔍 BigID is a data intelligence platform for managing data.
  • 📊 It uses machine learning for deep data discovery.
  • 🔒 The platform ensures compliance with regulations like CCPA and GDPR.
  • 🌐 BigID connects to all data sources, structured and unstructured.
  • 📈 Automated scanning adds context to data at scale.
  • 🗂️ The catalog provides a single view of the data environment.
  • 🔑 Users can apply classifiers and tags for sensitive information.
  • ⚙️ Customizable applications enhance data management capabilities.
  • 📉 BigID helps minimize risks associated with data access.
  • 🤝 Award-winning support is available for users.

时间轴

  • 00:00:00 - 00:05:00

    The video introduces BigID's data intelligence platform, emphasizing its unique capabilities in data cataloging, privacy, security, and governance. It highlights the importance of machine learning in discovering and managing data, allowing users to classify, tag, and understand their data objects effectively. The platform's ability to identify sensitive information and ensure compliance with regulations like CCPA and GDPR is also discussed, showcasing how it aids in data governance by providing insights into data relationships and risks.

  • 00:05:00 - 00:10:00

    In the demo, the BigID platform is showcased, demonstrating its user-friendly dashboard and the ability to connect to various data sources, both structured and unstructured. The platform's automated scanning and discovery features are highlighted, allowing users to visualize data lineage and profiling. The demo also covers how users can interact with data, report issues, and collaborate with data owners, emphasizing the platform's capabilities in data curation and classification, as well as its search functionality for easy data access.

  • 00:10:00 - 00:15:41

    The video concludes with a summary of BigID's offerings, including privacy controls, access management, and customer support. It emphasizes the platform's ability to connect to all data sources, automate data discovery, and provide context at scale, making data governance manageable in the face of rapid data growth. Viewers are encouraged to contact BigID for more information, reinforcing the platform's role in transforming data governance.

思维导图

视频问答

  • What is BigID?

    BigID is a data intelligence platform focused on data catalog capabilities, utilizing machine learning for data discovery and governance.

  • How does BigID ensure data governance?

    BigID uses machine learning to add context to data, helping organizations understand what their data is, where it is, and how it relates to other data.

  • What types of data can BigID manage?

    BigID can manage structured, unstructured, and semi-structured data across cloud, on-prem, and hybrid environments.

  • What are the key features of BigID?

    Key features include automated scanning, data classification, privacy controls, and customizable applications for data management.

  • How does BigID help with compliance?

    BigID identifies sensitive information and applies policy tags to ensure compliance with regulations like CCPA and GDPR.

  • Can BigID connect to various data sources?

    Yes, BigID connects to any data source, allowing for a comprehensive view of an organization's data environment.

  • What is the significance of the BigID catalog?

    The BigID catalog provides a single view of all data, enabling organizations to manage and govern their data effectively.

  • How does BigID use machine learning?

    BigID employs machine learning to enhance data discovery, classification, and context, allowing for deeper insights into data relationships.

  • What support does BigID offer?

    BigID provides award-winning customer support with global coverage for its users.

  • How is BigID priced?

    Pricing is based on the number of data sources and applications needed, allowing for customized packages.

查看更多视频摘要

即时访问由人工智能支持的免费 YouTube 视频摘要!
字幕
en
自动滚动:
  • 00:00:02
    hi this is a demo of big id's data
  • 00:00:05
    intelligence platform focused on catalog
  • 00:00:07
    capabilities
  • 00:00:09
    big id is the data intelligence platform
  • 00:00:11
    to know and manage your data big id is
  • 00:00:14
    unique because the platform is built on
  • 00:00:15
    a core discovery foundation with machine
  • 00:00:17
    learning to know your data
  • 00:00:21
    big-id is also unique because the
  • 00:00:22
    platform sits at the intersection of
  • 00:00:24
    privacy security and governance
  • 00:00:27
    and it all starts with a machine
  • 00:00:28
    learning augmented catalog
  • 00:00:30
    for example big id tackles governance
  • 00:00:33
    using machine learning for deep
  • 00:00:35
    discovery to add context at scale to
  • 00:00:37
    know what your data is and how it is
  • 00:00:39
    related and connected to other data
  • 00:00:40
    sources in your environment
  • 00:00:43
    because you can't govern data unless you
  • 00:00:45
    know what your data is and where it is
  • 00:00:47
    and what it means or you can't govern
  • 00:00:49
    what you don't know
  • 00:00:50
    to know your data big id adds
  • 00:00:52
    classifiers and tags to identify all of
  • 00:00:55
    your data objects
  • 00:00:56
    customers can automatically apply
  • 00:00:58
    attributes that add context tag objects
  • 00:01:00
    that include sensitive information and
  • 00:01:02
    classify objects by data type and even
  • 00:01:04
    custom sensitivity levels
  • 00:01:07
    the big-id catalog is privacy aware by
  • 00:01:09
    identifying sensitive information and
  • 00:01:11
    even going a step deeper to add policy
  • 00:01:13
    tags and notifications for regulation
  • 00:01:14
    compliance
  • 00:01:16
    with big id data users can see what
  • 00:01:18
    their data is if it is sensitive and if
  • 00:01:20
    they are at risk of violating any
  • 00:01:21
    policies like ccpa or gdpr
  • 00:01:24
    bridging into security
  • 00:01:26
    the big id catalog shows what data
  • 00:01:28
    objects are duplicates or have open
  • 00:01:30
    access
  • 00:01:31
    this is important for risk minimization
  • 00:01:33
    to identify and remediate any data that
  • 00:01:35
    is duplicate or sensitive data that has
  • 00:01:37
    open access
  • 00:01:38
    big id say the catalog provides that
  • 00:01:40
    insight to take action to remediate and
  • 00:01:42
    then audit to ensure the action was
  • 00:01:44
    taken all within the big id platform
  • 00:01:50
    for a visual representation of big id
  • 00:01:52
    this image illustrates how big id is
  • 00:01:54
    reimagining the machine learning data
  • 00:01:56
    catalog
  • 00:01:57
    the base layer here illustrates all of
  • 00:01:59
    the big-id connectors to connect your
  • 00:02:01
    data
  • 00:02:02
    big-id's data intelligence platform
  • 00:02:04
    connects to any and all data source so
  • 00:02:07
    structured or unstructured
  • 00:02:08
    semi-structured data the data can live
  • 00:02:10
    in cloud on-prem and hybrid environments
  • 00:02:13
    and even scanning data pipelines
  • 00:02:15
    this is a huge differentiator for big id
  • 00:02:17
    because the platform truly connects to
  • 00:02:19
    any data source
  • 00:02:20
    it is unique to be able to manage
  • 00:02:22
    structured data with unstructured data
  • 00:02:24
    for your complete environment imagine
  • 00:02:26
    being able to scan and gain intelligence
  • 00:02:28
    about all the files emails s3 buckets
  • 00:02:31
    salesforce notes and more in a single
  • 00:02:33
    platform
  • 00:02:34
    the middle layer is the discovery in
  • 00:02:37
    depth this is what big id calls its four
  • 00:02:39
    cs and these are four lenses catalog
  • 00:02:42
    classify correlate and cluster
  • 00:02:45
    lenses to discover define and know your
  • 00:02:47
    data to know what your data is and how
  • 00:02:49
    it's related to other data in your
  • 00:02:50
    environment and that way we can apply
  • 00:02:53
    deeper insight and understanding
  • 00:02:56
    the top layer is how you action your
  • 00:02:58
    data this represents the layer of apps
  • 00:03:00
    on top of the platform
  • 00:03:02
    and you can choose which apps you want
  • 00:03:03
    to apply for example add a data quality
  • 00:03:06
    app to measure and manage the quality of
  • 00:03:08
    your data and then display those data
  • 00:03:09
    quality scores in the catalog for users
  • 00:03:12
    to choose data assets with context and
  • 00:03:15
    quality
  • 00:03:16
    or maybe add a data retention app to set
  • 00:03:18
    retention policies to know when data
  • 00:03:20
    should be archived or deleted or to
  • 00:03:22
    identify data that must be saved for a
  • 00:03:24
    legal hold
  • 00:03:25
    the apps i'll integrate with the
  • 00:03:26
    platform to provide control from a
  • 00:03:28
    single point to consistently manage all
  • 00:03:30
    of your data and audit actions and tasks
  • 00:03:33
    the apps include a variety of
  • 00:03:35
    out-of-the-box functions and automation
  • 00:03:37
    while enabling customization so for
  • 00:03:39
    example you might want to set custom
  • 00:03:41
    policies based on priorities for your
  • 00:03:42
    organization
  • 00:03:48
    these are a series of critical
  • 00:03:50
    differentiators for big id we'll focus
  • 00:03:53
    on just a couple
  • 00:03:54
    first um let's notice that the coverage
  • 00:03:56
    of big id is broad so that means that
  • 00:03:59
    big id can
  • 00:04:01
    can scan and add context to any kind of
  • 00:04:04
    data source that's structured or
  • 00:04:06
    unstructured in the cloud or on-prem so
  • 00:04:08
    data at rest or emotion any kind of data
  • 00:04:10
    that you have in your environment big id
  • 00:04:12
    can automatically scan and classify and
  • 00:04:15
    put into a single catalog view so you
  • 00:04:17
    can manage all of your data in one view
  • 00:04:19
    with all consistent um consistent rules
  • 00:04:22
    and betty has
  • 00:04:24
    differentiator in context that we can
  • 00:04:26
    discover in depth is the core platform
  • 00:04:29
    and so we have customized views to know
  • 00:04:31
    your data big id is going to add
  • 00:04:33
    different kinds of machine learning and
  • 00:04:34
    even has a patented machine learning
  • 00:04:36
    engine to know your data with more
  • 00:04:38
    context see how split in your
  • 00:04:40
    environment and discover insights that
  • 00:04:42
    you wouldn't have otherwise had
  • 00:04:44
    that's interesting let's transition over
  • 00:04:46
    to the live demo and talk about some of
  • 00:04:48
    the rest of these differentiators
  • 00:04:56
    okay now we are in the big id platform
  • 00:04:58
    as you can see this is the dashboard
  • 00:05:00
    home screen you can see and some sources
  • 00:05:03
    here you can click through you can also
  • 00:05:05
    see the physical location of data in the
  • 00:05:07
    environment
  • 00:05:08
    the first thing we're going to talk
  • 00:05:09
    about here in the demo is the content
  • 00:05:12
    coverage so as i mentioned big id is
  • 00:05:15
    able to find all of your data sources
  • 00:05:17
    and types so it could be um structured
  • 00:05:20
    or unstructured data it could be on the
  • 00:05:21
    cloud on-prem wherever it is big id can
  • 00:05:24
    connect to it and add it to the catalog
  • 00:05:27
    with automated scanning and discovery so
  • 00:05:31
    um you can see here in our data
  • 00:05:32
    environment we have a whole selection of
  • 00:05:34
    a variety of data sources and types and
  • 00:05:38
    i can also very easily add a new data
  • 00:05:40
    source
  • 00:05:41
    here are a selection of um
  • 00:05:44
    pre-configured connectors to make it
  • 00:05:46
    super easy to connect your data and as
  • 00:05:47
    you can see we have a whole variety of
  • 00:05:50
    different data sources and types on-prem
  • 00:05:53
    cloud structured unstructured data it
  • 00:05:55
    all exists with a very easy to use
  • 00:05:57
    connection to import all of your data
  • 00:05:59
    into the catalog for a single view of
  • 00:06:02
    your data environment
  • 00:06:04
    next up we're talking about connectors
  • 00:06:06
    profiling and lineage so big id has
  • 00:06:09
    native functionality to visualize how
  • 00:06:11
    data is connected i'm sure you're all
  • 00:06:13
    aware that as far as lineage there are
  • 00:06:16
    your lineage providers that have very
  • 00:06:20
    sophisticated lineage tools and big id
  • 00:06:23
    is able to ingest that lineage content
  • 00:06:26
    and then add some um
  • 00:06:28
    add context to lineage so now you can
  • 00:06:30
    see that lineage with context in big id
  • 00:06:34
    another way you can think about lineage
  • 00:06:35
    here is with this data processing and
  • 00:06:37
    sharing here's just an example of how we
  • 00:06:40
    share some lineage and big id about just
  • 00:06:43
    how data flows so for example these are
  • 00:06:45
    all processes where we know that data is
  • 00:06:47
    moving you know from place to place
  • 00:06:48
    within an organization and i'm such a
  • 00:06:50
    data and map of how data is flowing
  • 00:06:53
    through an organization it's just
  • 00:06:54
    another kind of view of lineage in terms
  • 00:06:57
    of processing and data flows
  • 00:07:01
    for profiling the way that we do data
  • 00:07:03
    profiling is i can use the catalog
  • 00:07:08
    and i can see what kind of um you know
  • 00:07:10
    what my data looks like so for example
  • 00:07:14
    um let's look here at these attributes
  • 00:07:17
    name so i want to look at everything
  • 00:07:19
    that big id has determined has the
  • 00:07:21
    attribute of name and you can see here i
  • 00:07:23
    have a whole variety of different data
  • 00:07:25
    sources where name came up which is not
  • 00:07:27
    surprising so if i look at this for
  • 00:07:29
    example marketing
  • 00:07:32
    i can see that right away if i'm looking
  • 00:07:34
    for um
  • 00:07:35
    you have some profiling information just
  • 00:07:37
    to know about this data i can see right
  • 00:07:39
    away this marketing contains pi that
  • 00:07:41
    means it contains some sensitive
  • 00:07:42
    information i can see it was tagged as
  • 00:07:44
    high risk i can see more information
  • 00:07:46
    that was basically tagged
  • 00:07:48
    directly from big id with automated
  • 00:07:50
    scanning and context i can see the
  • 00:07:53
    attributes so then this marketing data
  • 00:07:56
    object i can see it has these different
  • 00:07:57
    attributes connected with
  • 00:07:59
    different classifiers for countries and
  • 00:08:01
    addresses and names which is also had to
  • 00:08:04
    be expected in a marketing database
  • 00:08:06
    i can see different columns that are
  • 00:08:09
    included in this data set so city and
  • 00:08:12
    country i can see what type of
  • 00:08:14
    characters i can see the attributes and
  • 00:08:16
    tags that they are applied here
  • 00:08:19
    and then i can see even a preview of the
  • 00:08:21
    data to get an idea about what the data
  • 00:08:23
    looks like to determine if it's what i'm
  • 00:08:24
    really looking for in my data and what
  • 00:08:27
    um what the true data
  • 00:08:29
    example is i can also interact with the
  • 00:08:31
    data another way if i see that there's
  • 00:08:33
    something here that doesn't look quite
  • 00:08:34
    right to me i can collaborate with the
  • 00:08:36
    data owners and i can report an issue
  • 00:08:38
    about it to get resolved or i can set
  • 00:08:41
    myself here as a watcher of the data to
  • 00:08:42
    get alerts for everything changes with
  • 00:08:45
    this data set that i am interested in
  • 00:08:48
    for data curation here we're going to
  • 00:08:50
    add context data the way that big id
  • 00:08:52
    does that is kind of our core function
  • 00:08:55
    right so we have correlation
  • 00:08:57
    um and in correlation we're going to add
  • 00:08:59
    context by seeing
  • 00:09:01
    where the
  • 00:09:02
    where data is um
  • 00:09:05
    is related to other data in our
  • 00:09:07
    environment so for example here in our
  • 00:09:09
    email suggestion we have
  • 00:09:12
    email here medical and i can see that it
  • 00:09:15
    is here in our patient information and
  • 00:09:17
    in our notes information this is letting
  • 00:09:19
    me know that the this email attribute is
  • 00:09:23
    related to these different um
  • 00:09:26
    these different data sources as well and
  • 00:09:28
    that i can click through to more of that
  • 00:09:30
    too it's all interactive
  • 00:09:32
    the way that we're going to interact
  • 00:09:33
    with our data is here in the cluster
  • 00:09:36
    analysis so i can get more curation i
  • 00:09:38
    can get more context from my data here
  • 00:09:40
    and cluster analysis by being able to
  • 00:09:42
    see what data is duplicate so for
  • 00:09:45
    example in this employee information
  • 00:09:47
    this is all groups of similar data i can
  • 00:09:50
    see based on how big the bubble is and
  • 00:09:52
    how far away it is from other bubbles
  • 00:09:54
    how closely related it is to other data
  • 00:09:57
    types and sources and then if i want to
  • 00:09:58
    click into my cluster analysis i can
  • 00:10:01
    look into the customer to click into the
  • 00:10:03
    cluster and i can see um
  • 00:10:05
    you know some let me see queries i can
  • 00:10:07
    see top keywords attributes i can see
  • 00:10:10
    objects for data source i can get more
  • 00:10:11
    information about the things that live
  • 00:10:13
    in that in that bubble things that are
  • 00:10:16
    similar information which is important
  • 00:10:17
    to know if i'm looking for something to
  • 00:10:19
    know if i have duplicate data so i know
  • 00:10:21
    that i'm using the right one or if i'm
  • 00:10:23
    doing a cleanup or if i'm doing a cloud
  • 00:10:25
    migration for example to know what data
  • 00:10:26
    to to migrate to know what is in my
  • 00:10:28
    environment and what i need to to know
  • 00:10:31
    about it
  • 00:10:32
    i want to show you also classification
  • 00:10:35
    big id is going to analyze your data not
  • 00:10:37
    just from the metadata but the actual
  • 00:10:39
    physical data we're going to analyze
  • 00:10:41
    that and determine what it is and then
  • 00:10:43
    we're going to add classifier so you can
  • 00:10:44
    use a whole selection of big id out of
  • 00:10:47
    the box classifications classifiers or
  • 00:10:49
    you can set your own custom classifiers
  • 00:10:52
    based on what is important
  • 00:10:54
    in your organization
  • 00:10:58
    we're going to talk next about the
  • 00:11:00
    search functionality once we have all of
  • 00:11:02
    this data and it's all been
  • 00:11:04
    um reported into it's all in found a big
  • 00:11:07
    id
  • 00:11:08
    i can search here in the catalog
  • 00:11:10
    and find that data that i'm looking for
  • 00:11:12
    i can either search using these quick
  • 00:11:14
    filters here at this side or i can
  • 00:11:16
    search using a quick
  • 00:11:18
    keyword here i can go back and see my
  • 00:11:21
    recently viewed object so i want to
  • 00:11:22
    continue working on a project maybe i
  • 00:11:24
    was working on earlier or i can type in
  • 00:11:27
    a keyword and see and look for some
  • 00:11:29
    account as my keyword i can see policies
  • 00:11:32
    and policies tables files to see
  • 00:11:35
    all of the different
  • 00:11:37
    different data objects or policies or
  • 00:11:40
    data um the objects that exist in big id
  • 00:11:43
    related to my keyword search
  • 00:11:47
    for data access
  • 00:11:50
    again if we go back to the catalog um
  • 00:11:54
    and we look back at any of that
  • 00:11:57
    information so for example you're back
  • 00:11:59
    in that marketing
  • 00:12:02
    we're looking at earlier today i can go
  • 00:12:04
    here to marketing and i can x at any
  • 00:12:07
    point you'll see this export is here and
  • 00:12:08
    i can export data to um to have a report
  • 00:12:11
    of the information i'm looking at so if
  • 00:12:13
    i'm looking at the os of columns i can
  • 00:12:15
    export and i can see that to access to
  • 00:12:18
    to use it to integrate with other
  • 00:12:20
    solutions as well
  • 00:12:22
    next on our list we talk about privacy
  • 00:12:23
    controls so we have privacy
  • 00:12:26
    notifications just like here contains pi
  • 00:12:28
    that just means it contains some
  • 00:12:30
    sensitive information depending on how
  • 00:12:32
    you define that um privacy controls
  • 00:12:34
    means that i can see what data is
  • 00:12:36
    sensitive so i can determine how to
  • 00:12:38
    protect it who should have access to it
  • 00:12:40
    for example um we can do
  • 00:12:43
    things like connecting externally to
  • 00:12:45
    snowflake and in snowflake we have a way
  • 00:12:48
    to say this data was determined to be
  • 00:12:50
    sensitive in big id which means that in
  • 00:12:52
    snowflake you may need to make sure
  • 00:12:54
    we can set some role based access
  • 00:12:57
    control and make sure that
  • 00:12:59
    only people who should see the data are
  • 00:13:01
    able to see it and everyone else if
  • 00:13:03
    you're asked to see that data you get
  • 00:13:04
    masking so you get masked data and it
  • 00:13:07
    happens dynamically so as um imagine
  • 00:13:10
    this data is flowing into your snowflake
  • 00:13:11
    environment being able to determine what
  • 00:13:13
    is sensitive who to have access and then
  • 00:13:15
    having that all enabled automatically
  • 00:13:17
    which is great because it means you're
  • 00:13:19
    going to enable people who should have
  • 00:13:20
    access to do very fast access for
  • 00:13:23
    analytics and you're going to
  • 00:13:25
    make sure your data is protected people
  • 00:13:27
    who don't need to have access shouldn't
  • 00:13:28
    see it
  • 00:13:30
    you also have access controls within big
  • 00:13:32
    id so under administration
  • 00:13:34
    you can see here we have access
  • 00:13:36
    management and that means we're able to
  • 00:13:38
    add different users as we add users we
  • 00:13:41
    can also assign them a role and then we
  • 00:13:43
    can go to their roles and determine
  • 00:13:45
    based on their role
  • 00:13:47
    we can define
  • 00:13:48
    at a very granular level what that user
  • 00:13:51
    is able to do or not do it allows us to
  • 00:13:54
    kind of give groups of users similar
  • 00:13:56
    roles and similar access to have similar
  • 00:13:59
    rights within the system for the big id
  • 00:14:02
    application itself
  • 00:14:05
    finally pricing and support
  • 00:14:07
    so on support um big id offers
  • 00:14:09
    award-winning customer support we have
  • 00:14:11
    staff around the globe we offer
  • 00:14:13
    continuous coverage for our customers as
  • 00:14:16
    far as pricing pricing is going to be
  • 00:14:18
    based on bands of data sources with apps
  • 00:14:21
    so
  • 00:14:22
    packages are really customized based on
  • 00:14:24
    your needs um you can
  • 00:14:26
    have more or less data connected and you
  • 00:14:29
    can have more or less applications so
  • 00:14:31
    these are all the different kinds of
  • 00:14:32
    applications that you can choose from
  • 00:14:34
    and you can also browse the marketplace
  • 00:14:36
    to find more applications that you might
  • 00:14:38
    want to add to take action on your data
  • 00:14:40
    in big id to learn more about any of
  • 00:14:42
    these functions i recommend that you
  • 00:14:44
    contact us at bigid.com
  • 00:14:46
    and in summary i want to leave you with
  • 00:14:48
    the idea that big id is the platform to
  • 00:14:51
    know your data it's based on a core
  • 00:14:52
    discovery we can connect to all your
  • 00:14:54
    data sources structured unstructured
  • 00:14:56
    on-prem in the cloud doesn't matter
  • 00:14:58
    where it is or what it is we can connect
  • 00:15:00
    it all in big id you can see it all in
  • 00:15:02
    the catalog and this is really
  • 00:15:04
    significant because if you imagine
  • 00:15:06
    trying to do all of this manually
  • 00:15:07
    discovering your data attacking it i'm
  • 00:15:09
    applying policies being able to
  • 00:15:11
    determine what is sensitive if you try
  • 00:15:12
    to do it manually it's almost impossible
  • 00:15:15
    with the rate of data growth and change
  • 00:15:17
    that we're seeing so that's why everyday
  • 00:15:19
    that's why big id is reimagining data
  • 00:15:22
    governance with a data catalog that
  • 00:15:23
    connects to all our data sources we
  • 00:15:25
    deliver automated scanning and we add
  • 00:15:27
    context at scale to know your data
  • 00:15:30
    have any more questions we'd be happy to
  • 00:15:32
    talk with you please contact us or check
  • 00:15:34
    out bigid.com for more information and
  • 00:15:37
    thanks for watching
  • 00:15:40
    you
标签
  • BigID
  • data intelligence
  • data governance
  • machine learning
  • data catalog
  • privacy compliance
  • data management
  • automated scanning
  • data classification
  • risk minimization