AllegroGraph Python API Reference¶
This is a description of the Python Application Programmer’s Interface (API) to Franz Inc.’s AllegroGraph.
The Python API offers convenient and efficient access to an AllegroGraph server from a Python-based application. This API provides methods for creating, querying and maintaining RDF data, and for managing the stored triples.
Note
The Python API deliberately emulates the Eclipse RDF4J API (formerly Aduna Sesame) to make it easier to migrate from RDF4J to AllegroGraph. The Python API has also been extended in ways that make it easier and more intuitive than the RDF4J API.
AllegroGraphServer class¶
-
class
franz.openrdf.sail.allegrographserver.
AllegroGraphServer
(host=None, port=None, user=None, password=None, cainfo=None, sslcert=None, verifyhost=None, verifypeer=None, protocol=None, proxy=None, **options) The AllegroGraphServer object represents a remote AllegroGraph server on the network. It is used to inventory and access the catalogs of that server.
Example:
server = AllegroGraphServer(host="localhost", port=8080, user="test", password="pw")
__init__
Define the connection to the AllegroGraph HTTP server. addRole
Creates a new role. addRoleAccess
Grant read/write access to a role. addRolePermission
Grants a role a certain permission. addRoleSecurityFilter
Add a security filter for the user. addScript
Create or replace a sitescript. addUser
Create a new user. addUserAccess
Grant read/write access to a user. addUserPermission
Assigns the given permission to this user. addUserRole
Add a role to a user. addUserSecurityFilter
Add a security filter for the user. changeUserPassword
Change the password for the given user. deleteRole
Deletes a role. deleteRoleAccess
Revoke read/write access for a role. deleteRolePermission
Revokes a permission for a role. deleteRoleSecurityFilter
Add a security filter for the user. deleteScript
Delete a sitescript. deleteUser
Delete a user. deleteUserAccess
Takes the same parameters as PUT on this URL, but revokes the access instead of granting it. deleteUserData
Remove user data from the server. deleteUserPermission
Revokes the given permission for this user. deleteUserRole
Remove a role from a user. deleteUserSecurityFilter
Add a security filter for the user. getInitfile
Retrieve the contents of the server initialization file. getScript
Get the body of a sitescript. getUserData
Retrieve user data value with given key. listCatalogs
Get the list of catalogs on this server. listRoleAccess
Query the access granted to a role. listRolePermissions
Lists the permission flags granted to a role. listRoleSecurityFilters
List security filters for user. listRoles
Returns the names of all defined roles. listScripts
Return the list of Sitescripts currently on the server. listUserAccess
Retrieve the read/write access for a user. listUserData
Get all used keys from the user data store on the server. listUserEffectiveAccess
Like listUserAccess, but also includes the access granted to roles that this user has. listUserEffectivePermissions
Retrieve the permission flags assigned to the user, or any of its roles. listUserPermissions
List the permission flags that have been assigned to a user (any of super, eval, session, replication). listUserRoles
Retrieves a list of role names for this user name. listUserSecurityFilters
List security filters for user. listUsers
Returns a list of names of all the users that have been defined. openCatalog
Open a catalog by name. openFederated
Open a session that federates several repositories. openSession
Open a session on a federated, reasoning, or filtered store. setInitfile
Replace the current initialization file contents with the content
string or remove if None.setUserData
Set user data with given key.
Catalog class¶
-
class
franz.openrdf.sail.allegrographserver.
Catalog
(name, server, client) Container of multiple repositories (triple stores).
Construct catalogs using the server object:
catalog = server.openCatalog('scratch')
createRepository
Creates a new repository with the given name. deleteRepository
Delete a repository. getName
Return the catalog name. getRepository
Creates or opens a repository. listRepositories
Return a list of names of repositories (triple stores) managed by this catalog.
Spec module¶
Helper functions for creating session specification strings.
See openSession()
federate |
Create a session spec for connecting to a federated store. |
graphFilter |
Create a graph-filtered session spec. |
local |
Create a session spec for connecting to a store on the local server. |
map |
Return a list of the results of applying the function to the items of the argument sequence(s). |
reason |
Create a session spec that adds reasoning support to another session. |
remote |
Create a session spec for connecting to a store on another server. |
url |
Create a session spec for connecting to a remote store with known URL. |
Repository class¶
-
class
franz.openrdf.repository.repository.
Repository
(catalog, database_name, repository) A repository contains RDF data that can be queried and updated. Access to the repository can be acquired by opening a connection to it. This connection can then be used to query and/or update the contents of the repository.
Please note that a repository needs to be initialized before it can be used and that it should be shut down before it is discarded/garbage collected. Forgetting the latter can result in loss of data (depending on the Repository implementation)!
Construct instances using
getRepository()
.with catalog.getRepository("agraph_test", Repository.ACCESS) as repo: ...
__init__
Invoke through getRepository()
.getConnection
Opens a connection to this store that can be used for querying and updating the contents of the store. getDatabaseName
Return the name of the database (remote triple store) that this repository is interfacing with. getSpec
Return a session spec string for this repository. getValueFactory
Return a ValueFactory for this store. initialize
Initializes this repository. isWritable
Checks whether this store is writable, i.e. registerDatatypeMapping
Register an inlined datatype. shutDown
Shuts the store down, releasing any resources that it keeps hold of.
Utility connection functions¶
Manually constructing server, catalog and repository objects is often
tedious when only a simple connection to a single repository is needed.
In that case the connection may be created directly using
ag_connect()
.
ag_connect |
Create a connection to an AllegroGraph repository. |
RepositoryConnection class¶
-
class
franz.openrdf.repository.repositoryconnection.
RepositoryConnection
(repository, close_repo=False, is_session=False) The RepositoryConnection class is the main interface for updating data in and performing queries on a
Repository
. By default, a RespositoryConnection is in autoCommit mode, meaning that each operation corresponds to a single transaction on the underlying triple store. autoCommit can be switched off, in which case it is up to the user to handle transaction commit/rollback. Note that care should be taken to always properly close a RepositoryConnection after one is finished with it, to free up resources and avoid unnecessary locks.Note that concurrent access to the same connection object is explicitly forbidden. The client must perform its own synchronization to ensure non-concurrent access.
Several methods take a vararg argument that optionally specifies a set of contexts on which the method should operate. (A context is the URI of a subgraph.) Note that a vararg parameter is optional, it can be completely left out of the method call, in which case a method either operates on a provided statement’s context (if one of the method parameters is a statement or collection of statements), or operates on the repository as a whole, completely ignoring context. A vararg argument may also be
None
, meaning that the method operates on those statements which have no associated context only.RepositoryConnection
objects should be constructed usinggetConnection()
. To ensure that repository connections are closed, the best practice is to use awith
statement:with repository.getConnection() as conn: ...
__init__
Call through getConnection()
.
General connection methods¶
This section contains the RepositoryConnection
methods that create,
maintain, search, and delete triple stores.
add |
Calls addTriple() , addStatement() , or addFile() . |
addData |
Adds data from a string to the repository. |
addFile |
Loads a file into the triple store. |
addStatement |
Add the supplied statement to the specified contexts in the repository. |
addTriple |
Add a single triple to the repository. |
addTriples |
Add the supplied triples or quads to this repository. |
clear |
Removes all statements from designated contexts in the repository. |
clearNamespaces |
Delete all namespaces in this repository for the current user. |
close |
Close the connection. |
createBNode |
Create a new blank node. |
createLiteral |
Create a new literal with value value . |
createRange |
Create a compound literal representing a range from lowerBound to upperBound. |
createStatement |
Create a new Statement object. |
createURI |
Creates a new URI from the supplied string-representation(s). |
deleteDuplicates |
Delete duplicate triples from the store. |
executeBooleanQuery |
Prepare and immediately evaluate a query that returns a boolean. |
executeGraphQuery |
Prepare and immediately evaluate a query that returns RDF. |
executeTupleQuery |
Prepare and immediately evaluate a query that returns tuples. |
executeUpdate |
Prepare and immediately evaluate a SPARQL update query. |
export |
Export all explicit statements in the specified contexts to a file. |
exportStatements |
Export statements to a file. |
getAddCommitSize |
Get the current value of add_commit_size . |
getContextIDs |
Return a list of context resources, one for each context referenced by a quad in the triple store. |
getDuplicateStatements |
Return all duplicates in the store. |
getNamespace |
Get the namespace that is associated with the specified prefix, if any. |
getNamespaces |
Get all declared prefix/namespace pairs. |
getSpec |
Get the session specification string for this repository. |
getStatements |
Get all statements with a specific subject, predicate and/or object from the repository. |
getStatementsById |
Return all statements whose triple ID matches an ID in the list ‘ids’. |
getValueFactory |
Get the ValueFactory associated with this repository. |
isEmpty |
Return True if this repository does not contain any (explicit) statements. |
namespace |
Creates an object that allows for simple creation of URIs in given namespace. |
prepareBooleanQuery |
Parse query into a boolean query object which can be executed against the triple store. |
prepareGraphQuery |
Parse query into a graph query object which can be executed against the triple store. |
prepareTupleQuery |
Parse query into a tuple query object which can be executed against the triple stroe. |
prepareUpdate |
Parse query into an update query object which can be executed against the triple store. |
registerDatatypeMapping |
Register an inlined datatype. |
remove |
Call removeTriples() or removeStatement() . |
removeNamespace |
Remove a namespace declaration by removing the association between a prefix and a namespace name. |
removeQuads |
Remove enumerated quads from this repository. |
removeQuadsByID |
Remove all quads with matching IDs. |
removeStatement |
Remove the supplied statement from the specified contexts in the repository. |
removeTriples |
Remove the statement(s) with the specified subject, predicate and object from the repository, optionally restricted to the specified contexts. |
setAddCommitSize |
Set the value of add_commit_size . |
setNamespace |
Define or redefine a namespace mapping in the repository. |
size |
Returns the number of (explicit) statements that are in the specified contexts in this repository. |
warmup |
Warm up specified parts of the repository in order to speed up access. |
add_commit_size |
The threshold for commit size during triple add operations. |
Triple Index Methods¶
These RepositoryConnection
methods support user-defined triple indices.
See AllegroGraph Triple Indices for more
information on this topic.
listIndices |
Return the list of the current set of triple indices. |
listValidIndices |
Return the list of valid index names. |
addIndex |
Add a specific type of index to the current set of triple indices. |
dropIndex |
Removes a specific type of index to the current set of triple indices. |
optimizeIndices |
Optimize indices. |
Free Text Indexing Methods¶
The following RepositoryConnection
methods support free-text indexing in
AllegroGraph.
createFreeTextIndex |
Create a free-text index with the given parameters. |
deleteFreeTextIndex |
Delete a free-text index from the server. |
evalFreeTextSearch |
Return a list of statements for the given free-text pattern search. |
getFreeTextIndexConfiguration |
Get the current settings of a free-text index. |
listFreeTextIndices |
Get the names of currently defined free-text indices. |
modifyFreeTextIndex |
Modify parameters of a free-text index. |
Note that text search is implemented through a SPARQL query using a
“magic” predicate called fti:search
. See the AllegroGraph
Python API Tutorial for an example of how to set up this
search.
Prolog Rule Inference Methods¶
These RepositoryConnection
methods support the use of Prolog rules in
AllegroGraph. Any use of Prolog rules requires that you create a
transaction to run them in.
addRules |
Add Prolog functors to the current session. |
loadRules |
Add Prolog rules from file to the current session. |
Geospatial Reasoning Methods¶
These RepositoryConnection
methods support geospatial reasoning.
createBox |
Create a rectangular search region (a box) for geospatial search. |
createCircle |
Create a circular search region for geospatial search. |
createCoordinate |
Create an x, y or latitude, longitude coordinate in the current coordinate system. |
createLatLongSystem |
Create a spherical coordinate system and use it as the current coordinate system. |
createPolygon |
Define a polygonal region with the specified vertices. |
createRectangularSystem |
Create a Cartesian coordinate system and use it as the current coordinate system. |
getGeoType |
Get the current geospatial coordinate system. |
setGeoType |
Set the current geospatial coordinate system. |
Transactions¶
AllegroGraph lets you set up a special RepositoryConnection
(a
“session”) that supports transaction semantics. You can add statements
to this session until you accumulate all the triples you need for a
specific transaction. Then you can commit the triples in a single act.
Up to that moment the triples will not be visible to other users of the
repository.
If anything interrupts the accumulation of triples building to the transaction, you can roll back the session. This discards all of the uncommitted triples and resynchronizes the session with the repository as a whole.
Closing the session discards all uncommitted triples and all rules, generators, and matrices that were created in the session. Rules, generators, and matrices cannot be committed. They persist as long as the session persists.
openSession |
Open a session. |
closeSession |
Close a session. |
session |
A session context manager for use with the with statement: |
commit |
Commit changes on an open session. |
rollback |
Roll back changes on open session. |
Subject Triples Caching¶
You can enable subject triple caching to speed up queries where the same subject URI appears in multiple patterns. The first time AllegroGraph retrieves triples for a specific resource, it caches the triples in memory. Subsequent query patterns that ask for the same subject URI can retrieve the matching triples very quickly from the cache. The cache has a size limit and automatically discards old entries as that limit is exceeded.
enableSubjectTriplesCache |
Maintain a cache of size size that caches, for each accessed resource, quads where the resource appears in subject position. |
disableSubjectTriplesCache |
Disable the subject triples cache (see enableSubjectTriplesCache() ). |
getSubjectTriplesCacheSize |
Return the current size of the subject triples cache. |
Query Class (and Subclasses)¶
Note
The Query class is non-instantiable. It is an abstract class from which the three query subclasses are derived. It is included here because of its methods, which are inherited by the subclasses.
A query on a Repository
that can be formulated
in one of the supported query languages (for example SPARQL). The query can
be parameterized, to allow one to reuse the same query with different parameter
bindings.
The best practice is to allow the RepositoryConnection
object to
create an instance of one of the Query subclasses
(TupleQuery
, GraphQuery
, BooleanQuery
, UpdateQuery
).
There is no reason for the Python application programmer to create
such objects directly.
tupleQuery = conn.prepareTupleQuery(QueryLanguage.SPARQL, queryString)
result = tupleQuery.evaluate()
evaluate_generic_query |
Evaluate a SPARQL or PROLOG query, which may be a ‘select’, ‘construct’, ‘describe’ or ‘ask’ query (in the SPARQL case). |
getBindings |
Retrieve the bindings that have been set on this query. |
getDataset |
Get the dataset against which this query will operate. |
getIncludeInferred |
Check whether this query will return inferred statements. |
removeBinding |
Removes a previously set binding on the supplied variable. |
setBinding |
Binds the specified variable to the supplied value. |
setBindings |
Set multiple variable bindings. |
setCheckVariables |
Determine whether the presence of variables in the select clause not referenced in a triple is flagged. |
setConnection |
Internal call to embed the connection into the query. |
setContexts |
Assert a set of contexts (named graphs) that filter all triples. |
setDataset |
Select the dataset against which to evaluate this query, overriding any dataset that is specified in the query itself. |
setIncludeInferred |
Determine whether evaluation results of this query should include inferred statements. |
Subclass TupleQuery¶
This subclass is used with SELECT queries. Use the
prepareTupleQuery()
method to create
a TupleQuery object. The results of the query are returned in a
QueryResult
iterator that yields a sequence of
binding sets
.
TupleQuery uses all the methods of the Query
class,
plus two more:
evaluate |
Execute the embedded query against the RDF store. |
analyze |
Analyze the query. |
Subclass GraphQuery¶
This subclass is used with CONSTRUCT and DESCRIBE queries. Use the
prepareGraphQuery()
method to create a
GraphQuery object. The results of the query are returned in a
GraphQueryResult
iterator that yields a
sequence of statements
.
GraphQuery
implements all the methods of the Query
class,
plus one more:
evaluate |
Execute the embedded query against the RDF store. |
Subclass BooleanQuery¶
This subclass is used with ASK queries. Use the
prepareBooleanQuery()
method to create
a BooleanQuery
object. The results of the query are
True
or False
.
BooleanQuery implements all the methods of the Query
class, plus one more:
evaluate |
Execute the embedded query against the RDF store. |
Subclass UpdateQuery¶
This subclass is used for DELETE
and INSERT
queries. The
result returned when the query is evaluated is a boolean that can be
used to tell if the store has been modified by the operation. Use the
prepareUpdate()
method to create an
UpdateQuery
object.
UpdateQuery
implements all the methods of the Query
class, plus one more:
evaluate |
Execute the embedded update against the RDF store. |
QueryResult Class¶
A QueryResult object is simply an iterator that also has a
close()
method that must be called to free
resources. Such objects are returned as a result of SPARQL and PROLOG
query evaluation and should not be constructed directly. Result
objects are context managers and can be used in the with
statement. The recommended usage looks like this:
tupleQuery = conn.prepareTupleQuery(QueryLanguage.SPARQL, queryString)
with tupleQuery.evaluate() as results:
for result in results:
print result
It is also possible to use one of the execute*Query
methods
(e.g. executeTupleQuery()
) to prepare and execute a query in a
single call:
with conn.executeTupleQuery(queryString) as results:
for result in results:
print result
close |
Release resources used by this query result. |
__next__ |
Return the next result item if there is one. |
Subclass TupleQueryResult¶
A QueryResult subclass used for queries that return tuples.
getBindingNames |
Get the names of the bindings, in order of projection. |
getMetadata |
Get a nested dictionary containing query result metadata. |
Subclass GraphQueryResult¶
A QueryResult subclass used for queries that return statements. Objects
of this class are also RepositoryResult
instances.
RepositoryResult class¶
A RepositoryResult
object is a result collection of statements
that can be iterated over. It keeps
an open connection to the backend for lazy retrieval of individual
results. Additionally it has some utility methods to fetch all results
and add them to a collection.
By default, a RepositoryResult
is not necessarily a (mathematical)
set: it may contain duplicate objects. Duplicate filtering can be
switched on, but this should not be used lightly as the filtering
mechanism is potentially memory-intensive.
A RepositoryResult
must be closed after use to free up any resources
(open connections, read locks, etc.) it has on the underlying
repository. To make this easier it is a context manager and
can be used in the with
statement.
graphQuery = conn.prepareGraphQuery(QueryLanguage.SPARQL, queryString)
with graphQuery.evaluate() as results:
for result in results:
print result
close |
Shut down the iterator to be sure the resources are freed up. |
__next__ |
Return the next Statement in the answer, if there is one. |
enableDuplicateFilter |
Switch on duplicate filtering while iterating over objects. |
asList |
Returns a list containing all objects of this RepositoryResult in order of iteration. |
addTo |
Add all objects of this RepositoryResult to the supplied collection. |
rowCount |
Get the number of statements in this result object. |
Statement Class¶
A Statement
is a client-side triple. It encapsulates the subject,
predicate, object and context (subgraph) values of a single triple and
makes them available.
Best practice is to allow the RepositoryConnection.createStatement()
method to create and return the Statement object. There is no reason for
the Python application programmer to create a Statement object
directly.
stmt1 = conn.createStatement(alice, age, fortyTwo)
getContext |
Get the graph (the fourth, optional element of the statement). |
getObject |
Get the object (the third element of the statement). |
getPredicate |
Get the predicate (the second element of the statement). |
getSubject |
Get the subject (the first element of the statement). |
ValueFactory Class¶
A ValueFactory is a factory for creating URIs, blank nodes, literals
and Statement
objects. In the AllegroGraph Python interface,
the ValueFactory
class is regarded as obsolete. Its
functions have been subsumed by the expanded capability of the
RepositoryConnection
class. It is documented here for the
convenience of users porting an application from Eclipse RDF4J.
Social Network Analysis Methods¶
The following
RepositoryConnection
methods support Social Network Analysis in AllegroGraph. The Python API to the Social Network Analysis methods of AllegroGraph requires Prolog queries, and therefore must be run in a dedicated session.registerNeighborMatrix
name
.registerSNAGenerator
name
.