-
Notifications
You must be signed in to change notification settings - Fork 27
remove config and checkpoint dir from default spark config #357
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
|
Hi Carsten @behrica, my apologies for the delayed reply. Anything I can do to help to make this work for you? Do you need me to deploy a new version to Clojars? |
|
My change makes it work for me, |
|
Hi Carsten, I just gave you a maintainer access to the repo. Please feel free to change it however you see fit 🙏 |
|
@behrica it looks like the CI/CD build for this PR failed, but it was a while ago so the logs are no longer accessible to check why it failed. If you have maintainer access, you should have permission to re-try the build in the actions tab? Let's fix the build and then the PR will be in a mergeable state. @anthony-khong if you're ok with this PR could you approve it? Thanks 🙏 |
|
Actually @anthony-khong I'd be interested to know the original rationale for specifying the default config thus: As in, what would be lost (or break, not work) by removing this? I can look up what these config options mean for sure but I was thinking about anything non-obvious for newcomers? As in, the purpose of having a default session is to get working quickly correct? |
|
This seems Its old setting failed for me on latest Azure Databricks, as it points "somehow" to a non-writable location. The ohter 2 seems to be "performabce setting", which probbaly cannot be set "correctly" for all situations neither. I am by far no expert in Spark, but "No options" seems to be the best default. But here we touch again the question of the "default session". |
|
The docu on getOrCreate is clear:
eventhough I find this "weired" In any case to me a "user" need full control on the "options" incl. "checkpoint dir" and this is to given with curret code, some are applied always, its seems. I cannot change the config "before it is used", I think.... see here as well: The "atom" was once a "delay", which would maybe avoid the problem, but still be brittle somehow related to "swap it out" for testing. |
fixes #356