Skip to content

Commit e46c088

Browse files
authored
[ML-5665] Update to Spark 2.4 and TF 1.12 (#160)
1 parent 4095ea7 commit e46c088

File tree

6 files changed

+21
-16
lines changed

6 files changed

+21
-16
lines changed

.travis.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -18,11 +18,11 @@ cache:
1818

1919
env:
2020
matrix:
21-
- SCALA_BINARY_VERSION=2.11.8 SPARK_VERSION=2.3.1 SPARK_BUILD="spark-2.3.1-bin-hadoop2.7"
22-
SPARK_BUILD_URL="https://dist.apache.org/repos/dist/release/spark/spark-2.3.1/spark-2.3.1-bin-hadoop2.7.tgz"
21+
- SCALA_BINARY_VERSION=2.11.8 SPARK_VERSION=2.4.0 SPARK_BUILD="spark-2.4.0-bin-hadoop2.7"
22+
SPARK_BUILD_URL="https://dist.apache.org/repos/dist/release/spark/spark-2.4.0/spark-2.4.0-bin-hadoop2.7.tgz"
2323
PYTHON_VERSION=2.7.13
24-
- SCALA_BINARY_VERSION=2.11.8 SPARK_VERSION=2.3.1 SPARK_BUILD="spark-2.3.1-bin-hadoop2.7"
25-
SPARK_BUILD_URL="https://dist.apache.org/repos/dist/release/spark/spark-2.3.1/spark-2.3.1-bin-hadoop2.7.tgz"
24+
- SCALA_BINARY_VERSION=2.11.8 SPARK_VERSION=2.4.0 SPARK_BUILD="spark-2.4.0-bin-hadoop2.7"
25+
SPARK_BUILD_URL="https://dist.apache.org/repos/dist/release/spark/spark-2.4.0/spark-2.4.0-bin-hadoop2.7.tgz"
2626
PYTHON_VERSION=3.6.2
2727

2828
before_install:

README.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ Supported platforms:
1818
1919
See the file `project/Dependencies.scala` for adding your own platform.
2020

21-
Officially TensorFrames supports Spark 2.3+ and Scala 2.11.
21+
Officially TensorFrames supports Spark 2.4+ and Scala 2.11.
2222

2323
See the [user guide](https://github.com/databricks/tensorframes/wiki/TensorFrames-user-guide) for
2424
extensive information about the API.
@@ -30,11 +30,11 @@ TensorFrames is available as a
3030

3131
## Requirements
3232

33-
- A working version of Apache Spark (2.3 or greater)
33+
- A working version of Apache Spark (2.4 or greater)
3434

3535
- Java 8+
3636

37-
- (Optional) python 2.7+/3.4+ if you want to use the python interface.
37+
- (Optional) python 2.7+/3.6+ if you want to use the python interface.
3838

3939
- (Optional) the python TensorFlow package if you want to use the python interface. See the
4040
[official instructions](https://www.tensorflow.org/install/)
@@ -54,7 +54,7 @@ Additionally, for developement, you need the following dependencies:
5454
Assuming that `SPARK_HOME` is set, you can use PySpark like any other Spark package.
5555

5656
```bash
57-
$SPARK_HOME/bin/pyspark --packages databricks:tensorframes:0.5.0-s_2.11
57+
$SPARK_HOME/bin/pyspark --packages databricks:tensorframes:0.6.0-s_2.11
5858
```
5959

6060
Here is a small program that uses TensorFlow to add 3 to an existing column.
@@ -151,7 +151,7 @@ The scala support is a bit more limited than python. In scala, operations can be
151151
You simply use the published package:
152152

153153
```bash
154-
$SPARK_HOME/bin/spark-shell --packages databricks:tensorframes:0.5.0-s_2.11
154+
$SPARK_HOME/bin/spark-shell --packages databricks:tensorframes:0.6.0-s_2.11
155155
```
156156

157157
Here is the same program as before:
@@ -204,14 +204,14 @@ build/sbt distribution/spDist
204204
Assuming that SPARK_HOME is set and that you are in the root directory of the project:
205205

206206
```bash
207-
$SPARK_HOME/bin/spark-shell --jars $PWD/target/testing/scala-2.11/tensorframes-assembly-0.5.1-SNAPSHOT.jar
207+
$SPARK_HOME/bin/spark-shell --jars $PWD/target/testing/scala-2.11/tensorframes-assembly-0.6.1-SNAPSHOT.jar
208208
```
209209

210210
If you want to run the python version:
211211

212212
```bash
213-
PYTHONPATH=$PWD/target/testing/scala-2.11/tensorframes-assembly-0.5.1-SNAPSHOT.jar \
214-
$SPARK_HOME/bin/pyspark --jars $PWD/target/testing/scala-2.11/tensorframes-assembly-0.5.1-SNAPSHOT.jar
213+
PYTHONPATH=$PWD/target/testing/scala-2.11/tensorframes-assembly-0.6.1-SNAPSHOT.jar \
214+
$SPARK_HOME/bin/pyspark --jars $PWD/target/testing/scala-2.11/tensorframes-assembly-0.6.1-SNAPSHOT.jar
215215
```
216216

217217
## Acknowledgements

project/Dependencies.scala

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
object Dependencies {
22
// The spark version
3-
val targetSparkVersion = sys.props.getOrElse("spark.version", "2.3.1")
4-
val targetTensorFlowVersion = "1.10.0"
3+
val targetSparkVersion = sys.props.getOrElse("spark.version", "2.4.0")
4+
val targetTensorFlowVersion = "1.12.0"
55
}

python/requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,4 +3,4 @@ nose>=1.3.3
33
pandas>=0.19.1
44
# The proto files under src/main/protobuf must be in sync with the TF version here.
55
# You can use update-tf-proto.sh under dev/ to update the files.
6-
tensorflow==1.10.0
6+
tensorflow==1.12.0

src/main/protobuf/tensorflow/core/framework/step_stats.proto

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -67,6 +67,11 @@ message NodeExecStats {
6767
uint32 thread_id = 10;
6868
repeated AllocationDescription referenced_tensor = 11;
6969
MemoryStats memory_stats = 12;
70+
int64 all_start_nanos = 13;
71+
int64 op_start_rel_nanos = 14;
72+
int64 op_end_rel_nanos = 15;
73+
int64 all_end_rel_nanos = 16;
74+
int64 scheduled_nanos = 17;
7075
};
7176

7277
message DeviceStepStats {

version.sbt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
version in ThisBuild := "0.5.1-SNAPSHOT"
1+
version in ThisBuild := "0.6.0-SNAPSHOT"

0 commit comments

Comments
 (0)