Coherent Spark SDK

The Coherent Spark Python SDK is designed to elevate the developer experience and provide convenient access to Coherent Spark APIs.

👋 Just a heads-up: This SDK is supported by the community. If you encounter any bumps while using it, please report them by creating a new issue.

Installation

pip install cspark # or 'cspark[cli]' for CLI support.

This Python library requires Python 3.7+.

Usage

To use the SDK, you need a Coherent Spark account that lets you access the following:

Base URL (including the environment and tenant name)
User authentication (API key, bearer token, or OAuth2.0 client credentials details)
Spark service URI (to locate a specific resource):
- folder - the folder name (where the service is located)
- service - the service name
- version - the semantic version a.k.a revision number (e.g., 0.4.2)

In Spark, a folder acts as a container that holds one or more services. Think of folders as a way to organize and group related services together. Each service represents an Excel model that has been converted into a Spark service. Services can exist in multiple versions, representing different iterations or updates of that service over time.

When interacting with a Spark service, you are always working with a specific version - the latest one by default. You can explicitly specify an older version if you need to work with a previous iteration of the service.

Hence, there are various ways to indicate a Spark service URI in the SDK:

{folder}/{service}[{version}] - version is optional.
service/{service_id}
version/{version_id}

It is important to avoid using URL-encoded characters in the service URI as the SDK will take care of URL encoding for you.

Here's an example of how to execute a Spark service:

import cspark.sdk as Spark

with Spark.Client(env='my-env', tenant='my-tenant', api_key='my-api-key') as spark:
    response = spark.services.execute('my-folder/my-service', inputs={'value': 42})
print(response.data)

Explore the examples and docs folders to find out more about the SDK's capabilities. For more AI-powered guidance and code documentation, check out the DeepWiki (powered by Devin.ai).

PRO TIP: A service URI locator can be combined with other parameters to locate a specific service (or version of it) when it's not a string. For example, you may execute a public service using a UriParams object by specifying the folder, service, and public properties.

import cspark.sdk as Spark

with Spark.Client(env='my-env', tenant='my-tenant', api_key='open') as spark:
    uri = Spark.UriParams(folder='my-folder', service='my-service', public=True)
    response = spark.services.execute(uri, inputs={'value': 42})
    print(response.data)

See the Uri and UriParams classes for more details.

Client Options

As shown in the examples above, the Spark.Client is your entry point to the SDK. It is quite flexible and can be configured with the following options:

Base URL

base_url (default: os.getenv['CSPARK_BASE_URL']) indicates the base URL of Coherent Spark APIs. It should include the tenant and environment information.

spark = Spark.Client(base_url='https://excel.my-env.coherent.global/my-tenant', ...)

Alternatively, a combination of env and tenant options can be used to construct the base URL.

spark = Spark.Client(env='my-env', tenant='my-tenant', ...)

For more advanced customizations, you can extend the BaseUrl class and make the appropriate changes to support custom base_urls.

Authentication

The SDK supports three types of authentication schemes:

api_key (default: os.getenv['CSPARK_API_KEY']) indicates the API key (also known as synthetic key), which is sensitive and should be kept secure.

spark = Spark.Client(api_key='my-api-key', ...)

PRO TIP: The Spark platform supports public APIs that can be accessed without any form of authentication. In that case, you need to set api_key to open in order to create a Spark.Client.

token (default: os.getenv['CSPARK_BEARER_TOKEN']) indicates the bearer token. It can be prefixed with 'Bearer' or not. A bearer token is usually valid for a limited time and should be refreshed periodically.

spark = Spark.Client(token='Bearer my-access-token', ...) # with prefix
# or
spark = Spark.Client(token='my-access-token', ...) # without prefix

oauth (default: os.getenv['CSPARK_CLIENT_ID'] and os.getenv['CSPARK_CLIENT_SECRET'] or os.getenv['CSPARK_OAUTH_PATH']) indicates the OAuth2.0 client credentials. You can either provide the client ID and secret directly or the file path to the JSON file containing the credentials.

spark = Spark.Client(oauth={'client_id': 'my-client-id', 'client_secret': 'my-client-secret'}, ...)
# or
spark = Spark.Client(oauth='path/to/oauth/credentials.json', ...)

Additional Settings

timeout (default: 60_000 ms) indicates the maximum amount of time that the client should wait for a response from Spark servers before timing out a request.
max_retries (default: 2) indicates the maximum number of times that the client will retry a request in case of a temporary failure, such as an unauthorized response or a status code greater than 400.
retry_interval (default: 1.0 second) indicates the delay between each retry.
logger (default: True) enables or disables the logger for the SDK.
- If bool, determines whether or not the SDK should print logs.
- If dict, the SDK will print logs in accordance with the specified keyword arguments.
- If LoggerOptions, the SDK will print messages based on the specified options:
  - context (default: CSPARK v{version}): defines the context of the logs (e.g., CSPARK v0.1.6);
  - disabled (default: False) determines whether the logger should be disabled.
  - colorful (default: True) determines whether the logs should be colorful;
  - timestamp (default: True) determines whether the logs should include timestamps;
  - datefmt (default: '%m/%d/%Y, %I:%M:%S %p') defines the date format for the logs;
  - level (default: DEBUG) defines the logging level for the logs.

spark = Spark.Client(logger=False, ...)
# or
spark = Spark.Client(logger={'colorful': False}, ...)

http_client (default: None) indicates the custom HTTP client to use to perform HTTP requests. It is an instance of httpx.Client (or httpx.AsyncClient) and can be used to configure proxy, cookies, timeout, SSL verification, etc.

import httpx
import cspark.sdk as Spark

spark = Spark.Client(
    base_url='https://spark.my-env.coherent.global/my-tenant',
    token='Bearer my-access-token',
    http_client=httpx.Client(proxy='https://my-proxy-url')
)

Client Errors

SparkError is the base class for all custom errors thrown by the SDK. There are two types of it:

SparkSdkError: usually thrown when an argument (user entry) fails to comply with the expected format. Because it's a client-side error, it will include the invalid entry as the cause in most cases.
SparkApiError: when attempting to communicate with the API, the SDK will wrap any sort of failure (any error during the roundtrip) into SparkApiError, which includes the HTTP status code of the response and the request_id, a unique identifier of the request.

Some of the derived SparkApiError are:

Type	Status	When
`BadRequestError`	400	invalid request
`UnauthorizedError`	401	missing or invalid credentials
`ForbiddenError`	403	insufficient permissions
`NotFoundError`	404	resource not found
`ConflictError`	409	resource already exists
`RateLimitError`	429	too many requests
`InternalServerError`	500	server-side error
`ServiceUnavailableError`	503	server is down
`UnknownApiError`	`None`	unknown error

API Parity

The SDK aims to provide full parity with the Spark APIs over time. Below is a list of the currently supported APIs.

Authentication API - manages access tokens using OAuth2.0 Client Credentials flow:

Authorization.oauth.retrieve_token(config) generates new access tokens.

Folders API - manages folders:

Spark.folders.exists(name) checks if a folder exists.
Spark.folders.create(data) creates a new folder with additional info.
Spark.folders.find(name) finds folders by name, status, category, or favorite.
Spark.folders.update(id, data) updates a folder's information by id.
Spark.folders.delete(id) deletes a folder by id, including all its services.

Services API - manages Spark services:

Spark.services.exists(uri) checks if a service exists.
Spark.services.create(data) creates a new Spark service.
Spark.services.execute(uri, inputs) executes a Spark service.
Spark.services.transform(uri, inputs) executes a Spark service using Transforms.
Spark.services.get_versions(uri) lists all the versions of a service.
Spark.services.get_swagger(uri) gets the Swagger documentation of a service.
Spark.services.get_schema(uri) gets the schema of a service.
Spark.services.get_metadata(uri) gets the metadata of a service.
Spark.services.search([params]) searches for services with pagination and filtering options.
Spark.services.download(uri) downloads the excel file of a service.
Spark.services.recompile(uri) recompiles a service using specific compiler versions.
Spark.services.validate(uri, data) validates input data using static or dynamic validations.
Spark.services.delete(uri) deletes an existing service, including all its versions.

Batches API - manages asynchronous batch processing:

Spark.batches.describe() describes the batch pipelines across a tenant.
Spark.batches.create(params, [options]) creates a new batch pipeline.
Spark.batches.of(id) defines a client-side batch pipeline by ID.
Spark.batches.of(id).get_info() gets the details of a batch pipeline.
Spark.batches.of(id).get_status() gets the status of a batch pipeline.
Spark.batches.of(id).push(data, [options]) adds input data to a batch pipeline.
Spark.batches.of(id).pull([options]) retrieves the output data from a batch pipeline.
Spark.batches.of(id).dispose() closes a batch pipeline.
Spark.batches.of(id).cancel() cancels a batch pipeline.

Log History API - manages service execution logs:

Spark.logs.get(call_id) retrieves the detailed log of a service execution.
Spark.logs.rehydrate(uri, call_id) rehydrates the executed model into the original Excel file.
Spark.logs.download(data) downloads service execution logs as csv or json file.

ImpEx API - imports and exports Spark services:

Spark.impex.exp(data) exports Spark entities (versions, services, or folders).
Spark.impex.imp(data) imports previously exported Spark entities into the platform.
Spark.impex.exports.cancel(job_id) cancels an in-progress export job.
Spark.impex.exports.describe() describes export jobs across the tenant.

Other APIs - for other functionality:

Spark.health.check() checks the health status of a Spark environment.
Spark.config.get() fetches the platform configuration for the current user.
Spark.wasm.download(uri) downloads a service's WebAssembly module.
Spark.files.download(url) downloads temporary files issued by the Spark platform.

Contributing

Feeling motivated enough to contribute? Great! Your help is always appreciated.

Please read CONTRIBUTING.md for details on the code of conduct, and the process for submitting pull requests.

Copyright and License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.devcontainer		.devcontainer
.github		.github
docs		docs
examples		examples
src/cspark		src/cspark
test		test
.env.local		.env.local
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements-dev.lock		requirements-dev.lock
requirements.lock		requirements.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Coherent Spark SDK

Installation

Usage

Client Options

Base URL

Authentication

Additional Settings

Client Errors

API Parity

Contributing

Copyright and License

About

Uh oh!

Releases 19

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Coherent Spark SDK

Installation

Usage

Client Options

Base URL

Authentication

Additional Settings

Client Errors

API Parity

Contributing

Copyright and License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 19

Uh oh!

Contributors

Uh oh!

Languages