Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
45 changes: 45 additions & 0 deletions BUILDING.txt
Original file line number Diff line number Diff line change
Expand Up @@ -388,6 +388,51 @@ Create a local staging version of the website (in /tmp/hadoop-site)

Note that the site needs to be built in a second pass after other artifacts.

----------------------------------------------------------------------------------
Including Cloud Connector Dependencies in Distributions:

Hadoop distributions include the hadoop modules to work with data and services
on cloud infrastructure

However, dependencies are omitted for all cloud connectors except hadoop-azure
(abfs:// and wasb://) and possibly hadoop-gcp (gs://) and hadoop-tos (tos://).
For the latter two modules, it depends on shading options.

For hadoop-aws the AWS SDK bundle.jar omitted, but everything else is included.

* This keeps binary release size below the limit of apache distributions
* Reduces download and size overhead in docker usage.
* Reduces the CVE attack surface
* Reduces the risk of classpath conflict.

To produce a build with the specific desired dependencies, the build must be executed
with the relevant profile of ${module}-package.

For example, a build with the hadoop-aws and hadoop-azure-datalake dependencies,
run with

mvn package -Pdist -DskipTests -Dhadoop-aws-package -Dhadoop-azure-datalake-package

Available package profiles:
hadoop-aliyun-package
hadoop-aws-package
hadoop-azure-datalake-package
hadoop-cos-package
hadoop-gcp-package
hadoop-huaweicloud-package
hadoop-tos-package

To build a complete distribution then with all cloud dependencies included:

mvn package -Pdist -DskipTests -Dtar -Dmaven.javadoc.skip=true \
-Phadoop-aliyun-package,hadoop-aws-package,hadoop-azure-datalake-package\
-Phadoop-cos-package,hadoop-gcp-package,hadoop-tos-package

The resulting tar file will be too large to be distributable through ASF infrastructure.

The hadoop-gcp and hadoop-tos artifacts include their dependencies unless the distribution
is built with -DskipShade.

----------------------------------------------------------------------------------
Installing Hadoop

Expand Down
28 changes: 25 additions & 3 deletions LICENSE-binary
Original file line number Diff line number Diff line change
Expand Up @@ -203,18 +203,23 @@

--------------------------------------------------------------------------------
This project bundles some components that are also licensed under the Apache
License Version 2.0:
License Version 2.0.
Note: some of the listed artifacts may not be included in a given build of the binary
distribution; it depends on the build options. This list intends
to be inclusive of all which may be included:


hadoop-hdfs-project/hadoop-hdfs/src/main/webapps/static/nvd3-1.8.5.* (css and js files)
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/checker/AbstractFuture.java
hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/checker/TimeoutFuture.java

ch.qos.reload4j:reload4j:1.2.22
com.aliyun:aliyun-java-core:0.2.11-beta
com.aliyun:aliyun-java-sdk-core:4.5.10
com.aliyun:aliyun-java-sdk-kms:2.11.0
com.aliyun:aliyun-java-sdk-ram:3.1.0
com.aliyun:aliyun-java-sdk-sts:3.0.0
com.aliyun:java-trace-api:0.2.11-beta
com.aliyun.oss:aliyun-sdk-oss:3.13.2
com.cedarsoftware:java-util:1.9.0
com.cedarsoftware:json-io:2.5.1
Expand Down Expand Up @@ -268,8 +273,13 @@ com.google.http-client:google-http-client-jackson2:1.46.3
com.google.http-client:google-http-client:1.46.3
com.google.j2objc:j2objc-annotations:3.0.0
com.google.oauth-client:google-oauth-client:1.37.0
com.microsoft.azure:azure-storage:7.0.0
com.huaweicloud:esdk-obs-java:3.20.4.2
com.jamesmurty.utils:java-xmlbuilder-1.2.jar
com.microsoft.azure:azure-storage:7.0.1
com.nimbusds:nimbus-jose-jwt:10.4
com.squareup.okhttp3:okhttp:jar:3.14.2
com.squareup.okio:okio:jar:1.17.2
com.volcengine:ve-tos-java-sdk-hadoop:2.8.9.jar
com.zaxxer:HikariCP:4.0.3
commons-beanutils:commons-beanutils:1.9.4
commons-cli:commons-cli:1.9.0
Expand Down Expand Up @@ -346,6 +356,9 @@ io.opentelemetry:opentelemetry-sdk-logs:1.47.0
io.opentelemetry:opentelemetry-sdk-metrics:1.47.0
io.opentelemetry:opentelemetry-sdk-trace:1.47.0
io.opentelemetry.semconv:opentelemetry-semconv:1.29.0-alpha
io.opentracing:opentracing-api:0.33.0.jar
io.opentracing:opentracing-noop:0.33.0.jar
io.opentracing:opentracing-util:0.33.0.jar
io.reactivex:rxjava:1.3.8
io.reactivex:rxjava-string:1.1.1
io.reactivex:rxnetty:0.4.20
Expand Down Expand Up @@ -373,6 +386,8 @@ org.apache.htrace:htrace-core:3.1.0-incubating
org.apache.htrace:htrace-core4:4.1.0-incubating
org.apache.httpcomponents:httpclient:4.5.13
org.apache.httpcomponents:httpcore:4.4.13
org.apache.httpcomponents.client5:httpclient5:5.5
org.apache.httpcomponents.core5:httpcore5:5.5
org.apache.kafka:kafka-clients:3.9.0
org.apache.kerby:kerb-admin:2.0.3
org.apache.kerby:kerb-client:2.0.3
Expand Down Expand Up @@ -496,6 +511,7 @@ hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanage
bootstrap v3.3.6
broccoli-asset-rev v2.4.2
broccoli-funnel v1.0.1
cos_api-bundle-5.6.19.jar
datatables v1.11.5
em-helpers v0.5.13
em-table v0.1.6
Expand Down Expand Up @@ -541,7 +557,7 @@ com.microsoft.azure:azure-cosmosdb:2.4.5
com.microsoft.azure:azure-cosmosdb-commons:2.4.5
com.microsoft.azure:azure-cosmosdb-direct:2.4.5
com.microsoft.azure:azure-cosmosdb-gateway:2.4.5
com.microsoft.azure:azure-data-lake-store-sdk:2.3.3
com.microsoft.azure:azure-data-lake-store-sdk:2.3.9
com.microsoft.azure:azure-keyvault-core:1.0.0
com.microsoft.sqlserver:mssql-jdbc:6.2.1.jre7
org.bouncycastle:bcpkix-jdk18on:1.78.1
Expand All @@ -552,6 +568,7 @@ org.codehaus.mojo:animal-sniffer-annotations:1.24
org.jruby.jcodings:jcodings:1.0.13
org.jruby.joni:joni:2.1.2
org.ojalgo:ojalgo:43.0
org.reactivestreams:reactive-streams:1.0.3.jar
org.slf4j:jul-to-slf4j:1.7.36
org.slf4j:slf4j-api:1.7.36
org.slf4j:slf4j-reload4j:1.7.36
Expand Down Expand Up @@ -622,3 +639,8 @@ Public Domain
-------------

aopalliance:aopalliance:1.0

Dom4J license
-------------

org.dom4j:dom4j:2.1.4.jar
4 changes: 4 additions & 0 deletions dev-support/bin/dist-layout-stitching
Original file line number Diff line number Diff line change
Expand Up @@ -130,6 +130,10 @@ run cp -p "${ROOT}/README.txt" .
run copy "${ROOT}/hadoop-common-project/hadoop-common/target/hadoop-common-${VERSION}" .
run copy "${ROOT}/hadoop-common-project/hadoop-nfs/target/hadoop-nfs-${VERSION}" .
run copy "${ROOT}/hadoop-common-project/hadoop-registry/target/hadoop-registry-${VERSION}" .

# cloud connectors go into common
run copy "${ROOT}/hadoop-cloud-storage-project/hadoop-cloud-storage-dist/target/hadoop-cloud-storage-dist-${VERSION}" .

run copy "${ROOT}/hadoop-hdfs-project/hadoop-hdfs/target/hadoop-hdfs-${VERSION}" .
run copy "${ROOT}/hadoop-hdfs-project/hadoop-hdfs-nfs/target/hadoop-hdfs-nfs-${VERSION}" .
run copy "${ROOT}/hadoop-hdfs-project/hadoop-hdfs-client/target/hadoop-hdfs-client-${VERSION}" .
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,59 @@
<!--
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
this work for additional information regarding copyright ownership.
The ASF licenses this file to You under the Apache License, Version 2.0
(the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at

https://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the Li2cense is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->
<assembly xmlns="http://maven.apache.org/plugins/maven-assembly-plugin/assembly/1.1.3"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/plugins/maven-assembly-plugin/assembly/1.1.3 https://maven.apache.org/xsd/assembly-1.1.3.xsd">
<id>hadoop-cloud-storage</id>
<formats>
<format>dir</format>
</formats>
<includeBaseDirectory>false</includeBaseDirectory>

<!--
This is executed in directory hadoop-cloud-storage-project/hadoop-cloud-storage-dist
All paths must be relative to that.
-->
<fileSets>
<fileSet>
<directory>../../hadoop-tools/hadoop-aws/src/main/bin</directory>
<outputDirectory>/bin</outputDirectory>
<fileMode>0755</fileMode>
</fileSet>
<fileSet>
<directory>./../hadoop-tools/hadoop-aws/src/main/shellprofile.d</directory>
<includes>
<include>*</include>
</includes>
<outputDirectory>/libexec/shellprofile.d</outputDirectory>
<fileMode>0755</fileMode>
</fileSet>
</fileSets>

<dependencySets>
<dependencySet>
<outputDirectory>/share/hadoop/common/lib</outputDirectory>
<unpack>false</unpack>
<scope>runtime</scope>
<useProjectArtifact>false</useProjectArtifact>
<!-- Stop some needless artifact propagation -->
<excludes>
<exclude>org.apache.hadoop:hadoop-annotations</exclude>
<exclude>org.apache.hadoop.thirdparty:hadoop-shaded-guava</exclude>
</excludes>
</dependencySet>
</dependencySets>
</assembly>
Original file line number Diff line number Diff line change
Expand Up @@ -57,6 +57,7 @@
<exclude>**/file:/**</exclude>
<exclude>**/SecurityAuth.audit*</exclude>
<exclude>patchprocess/**</exclude>
<exclude>**/auth-keys.xml</exclude>
</excludes>
</fileSet>
</fileSets>
Expand Down
Loading