作者 | 郭炜过去十年,数据工程的主线,是 Modern Data Stack 对传统数仓体系的一次拆解与重组。我们把数据采集从数据库里拆出来,形成了 Data Ingestion,用 FiveTran、Airbyte、Apache SeaTunnel 来解决 ELT / CDC / Reverse ETL;把计算从存储里拆出来,形成了 Snowflake、Databricks、Iceberg、H ...
Though the AI era conjures a futuristic, tech-advanced image of the present, AI fundamentally depends on the same data standards that have been around forever. These data standards—such as being clean ...
第42届IEEE International Conference on Data Engineering(业内简称ICDE)于2026年5月4日至8日在加拿大蒙特利尔举行。作为数据与计算技术的全球顶会,每一届的ICDE都云集了世界各国头部科技公司与顶尖学者教授一起贡献一场高水平的竞技,并发表最先进的科技成果以及最据前瞻性的技术发展趋势。 在 AI 狂飙突进、行业喧嚣内卷的时代,总有一类公司:低调 ...
Genesis Computing has been recognised in Gartner's "Data Engineering 2.0" research report (G00852814, April 2026) for agentic ...
The engineering team at Meta recently outlined how the company migrated a data ingestion platform that transfers several petabytes of MySQL social graph data daily to improve reliability and ...
KDNuggets, a community site for data professionals, ranked “We Don’t Need Data Scientists, We Need Data Engineers,” by Mihail Eric, a venture capitalist, researcher, and educator, as its top story of ...