APC 技術ブログ

株式会社エーピーコミュニケーションズの技術ブログです。

株式会社 エーピーコミュニケーションズの技術ブログです。

SIG の検索結果:

Why a Major Japanese Financial Institution Chose Databricks to Accelerate its Data and AI-Driven Journey

…rivacy: Designed with a focus on protecting your data Taking advantage of these features, we provide an environment in which companies can efficiently analyze data. Democratizing Data and AI with Databricks By using Databricks for Trusted D…

Scaling Deep Learning Using Delta Lake Storage Format on Databricks

…d cache design in AI training ​ AI training requires high-speed processing of large amounts of data. Therefore, optimizing data access patterns and cache design are important. Specifically, factors such as: ​ Data locality: By concentrating…

Increasing Trust in Your Data: Enabling a Data Governance Program on Databricks Using Unity Catalog and ML-Driven MDM Part 2/2

…ain the design of the data hub and data lakehouse built by Comcast and introduce the efforts to improve data reliability through data matching and machine learning solutions. ​ Let's dive in! www.ap-com.co.jp ​ Data Hub and Data Lakehouse D…

Maximizing Value From Your Data with Lakehouse AI

…so made a significant investment in the Databricks CLI to code the MLOps infrastructure and provide deep integration with the CICD pipeline. This allows ML engineering teams to focus on high-value features and reap the benefits of having da…

Extending Lakehouse Architecture with Collaborative Identity

…g these insights, companies will be able to use data more effectively and maximize business value. We will continue to deliver the latest information in the field of data and AI, so please look forward to it! Conclusion This content based o…

Databricks As Code: How to Effectively Automate a Secure Lakehouse Using Terraform for Resource Provisioning

…product design Fleet health monitoring Improved customer experience By utilizing these data, Rivian aims to improve business efficiency and competitiveness. Data integration with Databricks Lakehouse Rivian uses Databricks Lakehouse to aggr…

Building a Real-Time Model Monitoring Pipeline on Databricks

…use the insights gained through it to improve AI products and machine learning models were explained. Model monitoring is an essential element for maintaining and improving the performance of machine learning models. By considering various …

Databricks Connect Powered by Spark Connect: Develop and Debug Spark From Any Developer Tool

…t and its Significance ​ With the introduction of Spark Connect, the Spark architecture was broken down into a single client and server, with a properly designed protocol introduced between them. This resulted in the following benefits: ​ A…

How Rec Room Processes Billions of Events Per Day with Databricks and RudderStack

…Extract insights using data science to improve user gaming experiences Optimize in-game features and content based on insights ​ Incorporating the Latest Concepts, Features, and Services ​ Rec Room's data team is further improving data proc…

The Future Brought by the Democratization of Data and AI - Data + AI Summit Keynote Day 1

… provides signals such as popularity, frequent users, update time, and upstream quality issues. This makes it easier to search for data within an organization and improves the efficiency of data utilization. ​ During the keynote, a demonstr…

Labcorp Data Platform Journey: From Selection to Go-Live in Six Months

…atabricks significantly improved data processing speed compared to the traditional Hadoop. Seamless data integration: It became easier to integrate data between Databricks and other data sources, allowing for smoother data integration. Flex…

Ray on Apache Spark™ Part 1

… Ray is designed to be accessible to anyone without expertise in distributed systems. This has the following advantages: ​ Even if you are not a distributed system expert, you can easily implement distributed processing. System performance …

How Comcast Effectv Drives Data Observability with Databricks and Monte Carlo-01

…product designed to prevent data downtime and provide data observability. Data downtime is when data is unavailable or unreliable. This can have a negative impact on your business. Monte Carlo offers features such as: Monitor data quality a…

Using Lakehouse to Fight Cancer: Ontada’s Journey to Establish a RWD Platform on Databricks Lakehouse

…viously designed and implemented DataLeak for Veterans Affairs and CMS. The theme of this talk is the importance of real-world data and evidence in cancer care. The purpose of the session is to introduce the implementation of real-world dat…

Introduction to Data Engineering on the Lakehouse

…usiness insights. Specifically, the process is as follows. Data collection from IoT devices: Honeywell collects data from various IoT devices and stores it in DLT Conclusion This content based on reports from members on site participating i…

Summary of posts on DATA+AI Summit 2023 (June 28, 2023)

…Getting Insight From Anything: Gathering Data With IoT Devices and Delta Live Optimizing Batch and Streaming Aggregations The Future is Open: Data Streaming in an Omni-Cloud Reality Using Lakehouse to Fight Cancer: Ontada’s Journey to Estab…

DATA+AI Summit2023に関する投稿まとめ(2023年6月28日分)

…Getting Insight From Anything: Gathering Data With IoT Devices and Delta Live(インサイトを得る:IoTデバイスとデルタライブでデータを収集する) Optimizing Batch and Streaming Aggregations(バッチとストリーミング集約の最適化) The Future is Open: Data Streaming in an Omni-Cloud Reality Using…

What is Data Mesh Architecture?

…wledge, insight discovery, and intelligence. The purpose is not only to collect external data, but also to develop more valuable products by embedding data into existing processes and systems. (Jobs-To-Be-Done Theory) 6.3. Self-Serve Data P…

Building a chat bot with Dolly using dbdemos ~Data preparation~

… system design and prompt engineering using LLM. Dolly and its reasoning were explained in the previous article. techblog.ap-com.co.jp The notebook in the demo below is used as a reference and explained. Build your Chat Bot with Dolly table…

Databricks-06. [Databricks × dbt] Test for model

…nequality sign in the where clause. Preview and browse the table before test results. As you can see from the image, the number of records extracted is 0. Now run the test command to make sure the test succeeds. Test successful. Here is a s…

I passed the certification exam within 3 months of discovering Databricks.

…ry 2023 Assigned to the Lakehouse department in late January Started learning about Databrick using the materials of Databricks Academy, but struggled to acquire infrastructure-related knowledge without SQL queries and knowledge that differ…

Introduction to Fivetran(2) - Link Fivetran and Databricks and import data from Google Sheets

…tabricks. Sign in to Databricks Workspace and click the Partner Connect button on the bottom left of the screen. On the next Connect selection screen, select Fivetran. Click Next on the screen after transition. ※This will generate a Persona…

Databricks-05. [Databricks x dbt] Connect with dbt with Partner Connect

… use when signing up for dbt Cloud is displayed, but pre-filled with the email address associated with your workspace. Click the Connect to dbt Cloud button to initiate the connection. Once connected, a new tab of dbt cloud will open. This …

Setting up Databricks on AWS and creating a workspace

…Bricks 2. SIGN UP AND CHOOSE YOUR SUBSCRIPTION PLAN 3.Create a workspace 3-1. Credential configuration Create an IAM Role 3-2 Storage configuration Create S3 bucket Workspace provisioning Conclusion Introduction I joined the company in Janu…

Terraform v1.5.0からの新機能:Import Block機能の紹介と既存ツールとの比較

…g OpenPGP signature verification Archive: /tmp/tfenv_download.wIik8e/terraform_1.5.0-rc2_linux_amd64.zip inflating: /home/hoge/.tfenv/versions/1.5.0-rc2/terraform Installation of terraform v1.5.0-rc2 successful. To make this your default ve…

Backstageで独自Pluginを実装する

…ame> to assign annotations to primitive array items /** @items.visibility frontend */ myItems: string[]; }; } ポイントは各要素に @visibilityを指定することです。visibilityには次の3つがあります。 backend : 無指定の場合のデフォルト backend pluginでのみ読み取ることができます。 secret : backendで読み取ること…

【PlatformCon 2023】Platform teamの投資判断指標

…ractice insights from Wise on how to measure platform engineering impact 続いても同じようにPlatform engineeringのインパクトを測定する内容です。 platformcon.com 正しい領域に投資しているか、何を優先すべきか、Platform / Product(実際に外部顧客に提供しているサービス)のいずれに投資すべきかなど様々な判断をするためにも、それぞれのインパクトを把握することが…

【PlatformCon 2023】この事例はぜひ参考にしたい! How to enable platform engineering

… of the design それぞれのPlatform productとして、機能・非機能要件を定義しそれにあわせてエンジニアを配置してるとのことです。 FeedbackをQuickに得るように どんなプロダクトも完璧なプロダクトというものはありません。完璧を求めすぎると無駄な投資を生むことにも繋がります。不完全なものでも素早くリリースし、フィードバックを得ていくことが重要です。(このあたりは一般的なプロダクト開発でもまったく同じことが言われていると思います。Platfor…

【PlatformCon 2023】リファレンスアーキテクチャから見るPlatform Engineering

…latform design with reference architectures」 というセッションを取り上げたいと思います。 platformcon.com セッション概要 このセッションは、現代の複雑になりすぎた環境とそれに伴う認知負荷の向上に対する解決策を、具体的なリファレンスアーキテクチャを示しつつ解説するという内容です。 アプリケーションの実行基盤としてはAWS上のEKSを使うというシナリオでしたが、特徴的なポイントとしては Humanitec社 の Plat…

【PlatformCon 2023】Platform Engineeringの最終ゴールのイメージ

…latform designs with refernce architectures. 最初はこちら。Platform Engineeringで標準化をする意義とそれをセルフサービスで提供するイメージです。 ここではシステム全体を標準化しそれをリファレンスアーキテクチャとしているようです。 platformcon.com 【概要】 低い開発者体験は組織の技術スタックとそこにいる人々に以下のようなマイナスのインパクトを与えます。 低いプロダクティビティとイノベーションレベル …