{"id":1193,"date":"2020-12-09T09:55:59","date_gmt":"2020-12-09T09:55:59","guid":{"rendered":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/?p=1193"},"modified":"2021-05-19T06:17:08","modified_gmt":"2021-05-19T06:17:08","slug":"amazon-athena-sql-without-the-database","status":"publish","type":"post","link":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/","title":{"rendered":"Amazon Athena: SQL \u2013 Without the Database"},"content":{"rendered":"\n<p>Many companies face the problem that data may be important for new applications years later, but when that time comes, they have long been deleted, or their structure has since been changed several times. Furthermore, data have often been selected, aggregated or transformed before they are first saved, i.e. they are no longer complete when they are to be used later.<\/p>\n\n\n\n<p>For data-intensive projects in the field of data science or AI in particular, suitable data must therefore first be collected again, causing significant delays in the planned projects.<\/p>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">How can data lakes help?<\/h2>\n\n\n\n<p>Data lakes are an architectural pattern that aims at making data from various applications available in a centralized ecosystem in the long term. Data from every segment and department of a company are stored in a central location if possible. Unlike with traditional data warehouses, however, the raw data are always stored as well, often in an object storage system such as S3.<\/p>\n\n\n\n<p>The advantage of this method is the fact that the information is available in its entirety, without being reduced or transformed when they are first stored like they are in traditional data warehouses. Consequently, the central data pool does not have a structure that is tailored for specific user requirements, i.e. in this case, the consumers have to deduce the meaning of the data themselves.<\/p>\n\n\n\n<p>In order to be able to efficiently exploit the advantage of data lakes, they should be provided on a cross-departmental level. This way, the data can be retrieved anywhere they are needed.<\/p>\n\n\n\n<p>It is possible to store the data in different zones, allowing access with different levels of abstraction. For data scientists, for example, low-level tools such as Athena are used to gain in-depth, detailed insight into the data pool, whereas more specialized data marts are preferable for technical departments.<\/p>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">What does Amazon Athena offer?<\/h2>\n\n\n\n<p>Amazon Athena allows for SQL queries to be executed directly on (semi-)structured data in S3 buckets, without the need for a database with a defined structure. Preparatory ETL (Extract Transform Load) processes as we know them from traditional data warehouses are not required for the work with the raw data, either.<\/p>\n\n\n\n<p>As Amazon Athena is a serverless service, no infrastructure has to be provided. This happens automatically in the background, and is transparent for the user. On the one hand, this reduces the effort and specialist knowledge required, and on the other hand, using this service only causes costs per gigabyte of the data read from S3.<\/p>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">Lecture at online campus event (German only)<\/h2>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<iframe loading=\"lazy\" src=\"https:\/\/www.facebook.com\/plugins\/video.php?href=https%3A%2F%2Fwww.facebook.com%2FZEISSDigitalInnovation%2Fvideos%2F214145523405097%2F&#038;show_text=0&#038;width=560\" width=\"600\" height=\"338\" style=\"border:none;overflow:hidden\" scrolling=\"no\" frameborder=\"0\" allowfullscreen=\"true\" allow=\"autoplay; clipboard-write; encrypted-media; picture-in-picture; web-share\" allowFullScreen=\"true\"><\/iframe>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p>The following video of our <a href=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/knowledge-transfer-and-team-building-during-the-coronavirus-pandemic\/\">first online campus event<\/a> gives more detailed insight into the technical background and the possibilities of application and optimization. It shows discussions about practical experiences and a brief live demonstration in the AWS console.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Many companies face the problem that data may be important for new applications years later, but when that time comes, they have long been deleted, or their structure has since been changed several times. <\/p>\n","protected":false},"author":106,"featured_media":1194,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"advgb_blocks_editor_width":"","advgb_blocks_columns_visual_guide":"","footnotes":""},"categories":[9],"tags":[203,616,617,618,619],"topics":[591],"class_list":["post-1193","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cloud","tag-editorschoice","tag-datalakes","tag-data-lakes","tag-aws-athena","tag-serverless","topics-online-campus"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.0 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Amazon Athena: SQL \u2013 Without the Database - ZEISS Digital Innovation Blog<\/title>\n<meta name=\"description\" content=\"Many companies face the problem that data may be important for new applications years later, but when that time comes, they have long been deleted, or their structure has since been changed several times.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Amazon Athena: SQL \u2013 Without the Database - ZEISS Digital Innovation Blog\" \/>\n<meta property=\"og:description\" content=\"Many companies face the problem that data may be important for new applications years later, but when that time comes, they have long been deleted, or their structure has since been changed several times.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/\" \/>\n<meta property=\"og:site_name\" content=\"Digital Innovation Blog\" \/>\n<meta property=\"article:published_time\" content=\"2020-12-09T09:55:59+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-05-19T06:17:08+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-scaled.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1920\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Michael Muck\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Michael Muck\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/\",\"url\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/\",\"name\":\"Amazon Athena: SQL \u2013 Without the Database - ZEISS Digital Innovation Blog\",\"isPartOf\":{\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-scaled.jpg\",\"datePublished\":\"2020-12-09T09:55:59+00:00\",\"dateModified\":\"2021-05-19T06:17:08+00:00\",\"author\":{\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#\/schema\/person\/4dd63ad26fa1018f5327e9310b89e188\"},\"description\":\"Many companies face the problem that data may be important for new applications years later, but when that time comes, they have long been deleted, or their structure has since been changed several times.\",\"breadcrumb\":{\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#primaryimage\",\"url\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-scaled.jpg\",\"contentUrl\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-scaled.jpg\",\"width\":2560,\"height\":1920},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Amazon Athena: SQL \u2013 Without the Database\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#website\",\"url\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/\",\"name\":\"Digital Innovation Blog\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#\/schema\/person\/4dd63ad26fa1018f5327e9310b89e188\",\"name\":\"Michael Muck\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2024\/06\/Muck_Michael_Profilbild_300x300px-150x150.jpg\",\"contentUrl\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2024\/06\/Muck_Michael_Profilbild_300x300px-150x150.jpg\",\"caption\":\"Michael Muck\"},\"description\":\"Michael Muck holds a master's degree in computer science and works as software architect and group leader at ZEISS Digital Innovation in Munich. His focus is on cloud technologies. He has project experience in various industries, including retail, logistics and the automotive industry.\",\"url\":\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/author\/enmichaelmuck\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Amazon Athena: SQL \u2013 Without the Database - ZEISS Digital Innovation Blog","description":"Many companies face the problem that data may be important for new applications years later, but when that time comes, they have long been deleted, or their structure has since been changed several times.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/","og_locale":"en_US","og_type":"article","og_title":"Amazon Athena: SQL \u2013 Without the Database - ZEISS Digital Innovation Blog","og_description":"Many companies face the problem that data may be important for new applications years later, but when that time comes, they have long been deleted, or their structure has since been changed several times.","og_url":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/","og_site_name":"Digital Innovation Blog","article_published_time":"2020-12-09T09:55:59+00:00","article_modified_time":"2021-05-19T06:17:08+00:00","og_image":[{"width":2560,"height":1920,"url":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-scaled.jpg","type":"image\/jpeg"}],"author":"Michael Muck","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Michael Muck","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/","url":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/","name":"Amazon Athena: SQL \u2013 Without the Database - ZEISS Digital Innovation Blog","isPartOf":{"@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#primaryimage"},"image":{"@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#primaryimage"},"thumbnailUrl":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-scaled.jpg","datePublished":"2020-12-09T09:55:59+00:00","dateModified":"2021-05-19T06:17:08+00:00","author":{"@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#\/schema\/person\/4dd63ad26fa1018f5327e9310b89e188"},"description":"Many companies face the problem that data may be important for new applications years later, but when that time comes, they have long been deleted, or their structure has since been changed several times.","breadcrumb":{"@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#primaryimage","url":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-scaled.jpg","contentUrl":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-scaled.jpg","width":2560,"height":1920},{"@type":"BreadcrumbList","@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/amazon-athena-sql-without-the-database\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/"},{"@type":"ListItem","position":2,"name":"Amazon Athena: SQL \u2013 Without the Database"}]},{"@type":"WebSite","@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#website","url":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/","name":"Digital Innovation Blog","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#\/schema\/person\/4dd63ad26fa1018f5327e9310b89e188","name":"Michael Muck","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/#\/schema\/person\/image\/","url":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2024\/06\/Muck_Michael_Profilbild_300x300px-150x150.jpg","contentUrl":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2024\/06\/Muck_Michael_Profilbild_300x300px-150x150.jpg","caption":"Michael Muck"},"description":"Michael Muck holds a master's degree in computer science and works as software architect and group leader at ZEISS Digital Innovation in Munich. His focus is on cloud technologies. He has project experience in various industries, including retail, logistics and the automotive industry.","url":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/author\/enmichaelmuck\/"}]}},"author_meta":{"display_name":"Michael Muck","author_link":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/author\/enmichaelmuck\/"},"featured_img":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-content\/uploads\/sites\/3\/2020\/12\/202012_AWS_Athena_titel-600x450.jpg","coauthors":[],"tax_additional":{"categories":{"linked":["<a href=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/category\/cloud\/\" class=\"advgb-post-tax-term\">Cloud<\/a>"],"unlinked":["<span class=\"advgb-post-tax-term\">Cloud<\/span>"]},"tags":{"linked":["<a href=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/category\/cloud\/\" class=\"advgb-post-tax-term\">Editor&#039;s Choice<\/a>","<a href=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/category\/cloud\/\" class=\"advgb-post-tax-term\">Datalakes<\/a>","<a href=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/category\/cloud\/\" class=\"advgb-post-tax-term\">Data Lakes<\/a>","<a href=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/category\/cloud\/\" class=\"advgb-post-tax-term\">AWS Athena<\/a>","<a href=\"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/category\/cloud\/\" class=\"advgb-post-tax-term\">Serverless<\/a>"],"unlinked":["<span class=\"advgb-post-tax-term\">Editor&#039;s Choice<\/span>","<span class=\"advgb-post-tax-term\">Datalakes<\/span>","<span class=\"advgb-post-tax-term\">Data Lakes<\/span>","<span class=\"advgb-post-tax-term\">AWS Athena<\/span>","<span class=\"advgb-post-tax-term\">Serverless<\/span>"]}},"comment_count":"0","relative_dates":{"created":"Posted 5 years ago","modified":"Updated 5 years ago"},"absolute_dates":{"created":"Posted on December 9, 2020","modified":"Updated on May 19, 2021"},"absolute_dates_time":{"created":"Posted on December 9, 2020 9:55 am","modified":"Updated on May 19, 2021 6:17 am"},"featured_img_caption":"","series_order":"","_links":{"self":[{"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/posts\/1193","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/users\/106"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/comments?post=1193"}],"version-history":[{"count":3,"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/posts\/1193\/revisions"}],"predecessor-version":[{"id":1369,"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/posts\/1193\/revisions\/1369"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/media\/1194"}],"wp:attachment":[{"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/media?parent=1193"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/categories?post=1193"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/tags?post=1193"},{"taxonomy":"topics","embeddable":true,"href":"https:\/\/blogs.zeiss.com\/digital-innovation\/en\/wp-json\/wp\/v2\/topics?post=1193"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}