什么是大数据生态系统What does Hadoop Ecosystem mean? Definition 1:
The Hadoop ecosystem refers to the various components of the Apache Hadoop software library, as well as to the accessories and tools provided by the Apache Software Foundation for these types of software projects, and to the ways that they work together. Definition 2:
Hadoop EcosyHstem is neither a programming language nor a service, it is a platform or framework which is extremely popular for handling and analyzing large sets of data.You can consider it as a suite which encompasses a number of services (ingesting, storing, analyzing and maintaining) inside it.
大数据生态系统习惯被称为Hadoop生态系统,这其中主要包含两个原因:
1. 由多个具备各自特点或功能的组件共同构成,并能够相互协作完成大数据处理任务
2. hadoop组件是其中最早出现也是最基础的组件 Hadoop 生态系统一览图hadoop生态圈中的组件的角色如图所示(关于各组件的详细介绍会在下一节给出),特别有趣的是,hadoop生态圈的组件大多具有动物或者昆虫类的图标,而作为管理者zookeeper即表示动物园管理员,即管理动物园生态圈。