W-Tools¶
This Python package provides a set of convenient tools we use in the W-Team / Data to Decisions research group at the Colorado School of Mines.
Getting Started¶
wtools
is installable using pip
. We haven’t yet deployed wtools
on PyPI as it is in its very early stages.
To get started using wtools
, clone this project:
$ git clone https://github.com/csmwteam/wtools.git
Then go into that cloned development directory and perform a local installation
via pip
in your active virtual environment:
$ cd wtools
$ pip install -e .
Then be sure to add the wtools
repository directory to your PYTHONPATH.
Contributing¶
Do you want to add features? Then go ahead and make commits to the project and push them to GitHub and create a Pull Request!
In your virtual environment, make sure you have all of the proper dependancies installed:
$ pip install -r requirements.txt
About W-Tools¶
- Author: Bane Sullivan
- License: BSD-3-Clause
- Copyright: 2018, Colorado School of Mines W-Team
- Version: 0.0.3
wtools: a Python package for W-Team research needs
File I/O¶
This module holds several methods for standard file I/O for the data formats that we work with regularly. Much of this regarding Grid objects in inherrited directly into the Grid class.
readGSLib¶
-
wtools.fileio.
readGSLib
(filename)[source]¶ This will read a standard GSLib or GeoEAS data file to a pandas
DataFrame
.Parameters: filename (str) – the string file name of the data to load. This can be a relative or abslute file path. Returns: A table containing the all data arrays. Note that an attribute called header
is added to the data frame contianing the string header line of the file.Return type: pandas.DataFrame
Mesh Tools¶
This provides a class for discretizing data in a convienant way that makes sense for our spatially referenced data/models.
Grid¶
-
class
wtools.mesh.
Grid
(**kwargs)[source]¶ Bases:
properties.base.base.HasProperties
,wtools.fileio.GridFileIO
- A data structure to store a model space discretization and different
attributes of that model space.
- Example:
>>> import wtools >>> import numpy as np >>> models = { 'rand': np.random.random(1000).reshape((10,10,10)), 'spatial': np.arange(1000).reshape((10,10,10)), } >>> grid = wtools.Grid(models=models) >>> grid.validate() # Make sure the data object was created successfully True
- Note:
- See Jupyter notebooks under the
examples
directory
Required Properties:
- origin (
Vector3
): The lower southwest corner of the data volume., a 3D Vector of <type ‘float’> with shape (3), Default: [0.0, 0.0, 0.0] - xtensor (
Array
): Tensor cell widths, x-direction, a list or numpy array of <type ‘float’>, <type ‘int’> with shape (*) - ytensor (
Array
): Tensor cell widths, y-direction, a list or numpy array of <type ‘float’>, <type ‘int’> with shape (*) - ztensor (
Array
): Tensor cell widths, z-direction, a list or numpy array of <type ‘float’>, <type ‘int’> with shape (*)
Optional Properties:
- models (
Dictionary
): The volumetric data as a 3D NumPy arrays in <X,Y,Z> or <i,j,k> coordinates. Each key value pair represents a different model for the gridded model space. Keys will be treated as the string name of the model., a dictionary (keys: a unicode string; values: a list or numpy array of <type ‘float’>, <type ‘int’> with shape (*, *, *))
-
bounds
¶ The bounds of the grid
-
classmethod
deserialize
(value, trusted=False, strict=False, assert_valid=False, **kwargs)¶ Creates HasProperties instance from serialized dictionary
This uses the Property deserializers to deserialize all JSON-compatible dictionary values into their corresponding Property values on a new instance of a HasProperties class. Extra keys in the dictionary that do not correspond to Properties will be ignored.
Parameters:
- value - Dictionary to deserialize new instance from.
- trusted - If True (and if the input dictionary has
'__class__'
keyword and this class is in the registry), the new HasProperties class will come from the dictionary. If False (the default), only the HasProperties class this method is called on will be constructed. - strict - Requires
'__class__'
, if present on the input dictionary, to match the deserialized instance’s class. Also disallows unused properties in the input dictionary. Default is False. - assert_valid - Require deserialized instance to be valid. Default is False.
- Any other keyword arguments will be passed through to the Property deserializers.
-
display
(plt, key, plane='xy', slc=None, showit=True, **kwargs)[source]¶ Display a 2D slice of this grid.
Parameters:
-
keys
¶ List of the string names for each of the models
-
models
¶ a unicode string; values: a list or numpy array of <type ‘float’>, <type ‘int’> with shape (*, *, *))
Type: models ( Dictionary
)Type: The volumetric data as a 3D NumPy arrays in <X,Y,Z> or <i,j,k> coordinates. Each key value pair represents a different model for the gridded model space. Keys will be treated as the string name of the model., a dictionary (keys
-
num_cells
¶ Number of cells
-
num_nodes
¶ Number of nodes (vertices)
-
nx
¶ Number of cells in the X direction
-
ny
¶ Number of cells in the Y direction
-
nz
¶ Number of cells in the Z direction
-
origin
¶ [0.0, 0.0, 0.0]
Type: origin ( Vector3
)Type: The lower southwest corner of the data volume., a 3D Vector of <type ‘float’> with shape (3), Default
-
classmethod
readSGeMSGrid
(fname, origin=[0.0, 0.0, 0.0], spacing=[1.0, 1.0, 1.0])¶ Reads an SGeMS grid file where grid shape is defined in the header as three integers seperated by whitespace. Data arrays are treated as 3D and given in <x, y, z> indexing to a
Grid
object.Parameters: Returns: The SGeMS data loaded onto a
Grid
object.Return type:
-
classmethod
readUBC
(name, directory='')¶
-
saveSGeMS
(filename)¶ This will save the grid in the SGeMS gridded data file format
-
saveUBC
(fname)¶ Save the grid in the UBC mesh format.
-
serialize
(include_class=True, save_dynamic=False, **kwargs)¶ Serializes a HasProperties instance to dictionary
This uses the Property serializers to serialize all Property values to a JSON-compatible dictionary. Properties that are undefined are not included. If the HasProperties instance contains a reference to itself, a
properties.SelfReferenceError
will be raised.Parameters:
- include_class - If True (the default), the name of the class
will also be saved to the serialized dictionary under key
'__class__'
- save_dynamic - If True, dynamic properties are written to the serialized dict (default: False).
- Any other keyword arguments will be passed through to the Property serializers.
- include_class - If True (the default), the name of the class
will also be saved to the serialized dictionary under key
-
shape
¶ 3D shape of the grid (number of cells in all three directions)
-
classmethod
tableToGrid
(df, shp, origin=[0.0, 0.0, 0.0], spacing=[1.0, 1.0, 1.0], order='F')¶ Converts a pandas
DataFrame
table to aGrid
object.Parameters: Returns: The data table loaded onto a
Grid
object.Return type:
-
toDataFrame
(order='C')[source]¶ Returns the models in this Grid to a Pandas DataFrame with all arrays flattened in the specified order. A header attribute is added to the DataFrame to specified the grid extents. Much metadata is lost in this conversion.
-
xcenters
¶ The cell center coordinates along the X-axis
-
xnodes
¶ The node coordinates along the X-axis
-
xtensor
¶ Tensor cell widths, x-direction, a list or numpy array of <type ‘float’>, <type ‘int’> with shape (*)
Type: xtensor ( Array
)
-
ycenters
¶ The cell center coordinates along the Y-axis
-
ynodes
¶ The node coordinates along the Y-axis
-
ytensor
¶ Tensor cell widths, y-direction, a list or numpy array of <type ‘float’>, <type ‘int’> with shape (*)
Type: ytensor ( Array
)
-
zcenters
¶ The cell center coordinates along the Z-axis
-
znodes
¶ The node coordinates along the Z-axis
Plotting Routines¶
plots
: This module provides various plotting routines that ensure we display
our spatially referenced data in logical, consistant ways across projects.
display¶
-
wtools.plots.
display
(plt, arr, x=None, y=None, **kwargs)[source]¶ This provides a convienant class for plotting 2D arrays that avoids treating our data like images. Since most datasets we work with are defined on Cartesian coordinates, <i,j,k> == <x,y,z>, we need to transpose our arrays before plotting in image plotting libraries like
matplotlib
.Parameters: - plt (handle) – your active plotting handle
- arr (np.ndarray) – A 2D array to plot
- kwargs (dict) – Any kwargs to pass to the
pcolormesh
plotting routine
Returns: plt.pcolormesh
Example
>>> import numpy as np >>> import matplotlib.pyplot as plt >>> arr = np.arange(1000).reshape((10,100)) >>> wtools.display(plt, arr) >>> plt.title('What we actually want') >>> plt.colorbar() >>> plt.show()
plotStructGrid¶
-
wtools.plots.
plotStructGrid
(plt, outStruct, gridspecs=None, imeas=None)[source]¶ Plot a semivariogram or covariogram produced from raster2structgrid
Parameters: - plt (handle) – An active plotting handle. This allows us to use the plotted result after the routine.
- outStruct (np.ndarray) – the data to plot
- gridspecs (list(GridSpec)) – the spatial reference of your gdata
- imeas (str) – key indicating which structural measure to label:
'var'
for semi-variogram or'covar'
for covariogram. This simply adds a few labels to the active figure. If semi-variance useTrue
. If covariance, useFalse
.
Returns: plt.plot or plt.pcolor
Array Transforms¶
transform
: This module provides several conveinance methods for
transforming NumPy arrays in a Cartesian coordinate system.
emptyArray¶
meshgrid¶
-
wtools.transform.
meshgrid
(x, y, z=None)[source]¶ Use this convienance method for your meshgrid needs. This ensures that we always use <ij> indexing to stay consistant with Cartesian grids.
This simply provides a wrapper for
np.meshgrid
ensuring we always useindexing='ij'
which makes sense for typical Cartesian coordinate systems (<x,y,z>).Note
This method handles 2D or 3D grids.
Example
>>> import wtools >>> import numpy as np >>> x = np.arange(20, 200, 10) >>> y = np.arange(20, 500, 20) >>> z = np.arange(0, 1000, 50) >>> xx, yy, zz = wtools.meshgrid(x, y, z) >>> # Now check that axii are ordered correctly >>> assert(xx.shape[0] == len(x)) >>> assert(xx.shape[1] == len(y)) >>> assert(xx.shape[2] == len(z))
transpose¶
-
wtools.transform.
transpose
(arr)[source]¶ Transpose matrix from Cartesian to Earth Science coordinate system. This is useful for UBC Meshgrids where +Z is down.
Note
Works forward and backward.
Parameters: arr (ndarray) – 3D NumPy array to transpose with ordering: <i,j,k> Returns: same array transposed from <i,j,k> to <j,i,-k> Return type: ndarray Example
>>> import wtools >>> import numpy as np >>> model = np.random.random(1000).reshape((10, 20, 5)) >>> wtools.transpose(model).shape (20, 10, 5)
Geostatistics¶
Grids¶
GridSpec¶
-
class
wtools.geostats.grids.
GridSpec
(**kwargs)[source]¶ Bases:
properties.base.base.HasProperties
- A
GridSpec
object provides the details of a single axis along a grid. - If you have a 3D grid then you will have 3
GridSpec
objects.
Required Properties:
- min (
Integer
): The minimum value along this dimension. The origin., an integer, Default: 0 - n (
Integer
): The number of components along this dimension., an integer - sz (
Integer
): The uniform cell size along this dimension., an integer, Default: 1
Optional Properties:
- nnodes (
Integer
): The number of grid nodes to consider on either side of the origin in the output map, an integer
-
classmethod
deserialize
(value, trusted=False, strict=False, assert_valid=False, **kwargs)¶ Creates HasProperties instance from serialized dictionary
This uses the Property deserializers to deserialize all JSON-compatible dictionary values into their corresponding Property values on a new instance of a HasProperties class. Extra keys in the dictionary that do not correspond to Properties will be ignored.
Parameters:
- value - Dictionary to deserialize new instance from.
- trusted - If True (and if the input dictionary has
'__class__'
keyword and this class is in the registry), the new HasProperties class will come from the dictionary. If False (the default), only the HasProperties class this method is called on will be constructed. - strict - Requires
'__class__'
, if present on the input dictionary, to match the deserialized instance’s class. Also disallows unused properties in the input dictionary. Default is False. - assert_valid - Require deserialized instance to be valid. Default is False.
- Any other keyword arguments will be passed through to the Property deserializers.
-
equal
(other)¶ Determine if two HasProperties instances are equivalent
Equivalence is determined by checking if all Property values on two instances are equal, using
Property.equal
.
-
min
¶ 0
Type: min ( Integer
)Type: The minimum value along this dimension. The origin., an integer, Default
-
nnodes
¶ The number of grid nodes to consider on either side of the origin in the output map, an integer
Type: nnodes ( Integer
)
-
serialize
(include_class=True, save_dynamic=False, **kwargs)¶ Serializes a HasProperties instance to dictionary
This uses the Property serializers to serialize all Property values to a JSON-compatible dictionary. Properties that are undefined are not included. If the HasProperties instance contains a reference to itself, a
properties.SelfReferenceError
will be raised.Parameters:
- include_class - If True (the default), the name of the class
will also be saved to the serialized dictionary under key
'__class__'
- save_dynamic - If True, dynamic properties are written to the serialized dict (default: False).
- Any other keyword arguments will be passed through to the Property serializers.
- include_class - If True (the default), the name of the class
will also be saved to the serialized dictionary under key
-
validate
()¶ Call all registered class validator methods
These are all methods decorated with
@properties.validator
. Validator methods are expected to raise a ValidationError if they fail.
- A
geoeas2numpy¶
-
wtools.geostats.grids.
geoeas2numpy
(datain, nx, ny=None, nz=None)[source]¶ Transform GeoEas array into np.ndarray to be treated like image. Function to transform a SINGLE GoeEas-formatted raster (datain) i.e., a single column, to a NumPy array that can be viewed using imshow (in 2D) or slice (in 3D).
Parameters: Returns: - If only nx given: 1D array.
If only nx and ny given: 2D array. If nx, ny, and nz given: 3D array.
Return type: np.ndarray
Note
In 3D, z increases upwards
References
- Originally implemented in MATLAB by:
- Phaedon Kyriakidis, Department of Geography, University of California Santa Barbara, May 2005
- Reimplemented into Python by:
- Bane Sullivan and Jonah Bartrand, Department of Geophysics, Colorado School of Mines, October 2018
Rasters¶
This module provides useful methods for operating on 1D and 2D rasters such as making variogram or covariograms.
raster2structgrid¶
-
wtools.geostats.raster.
raster2structgrid
(datain, imeas='covar', rtol=1e-10)[source]¶ Create an auto-variogram or auto-covariance map from 1D or 2D rasters. This computes auto-variogram or auto-covariance maps from 1D or 2D rasters. This function computes variograms/covariances in the frequency domain via the Fast Fourier Transform (
np.fftn
).Note
For viewing the results, please use the
plotStructGrid
method from theplots
module.Note
Missing values, flagged as
np.nan
, are allowed.Parameters: - datain (np.ndarray) – input arrray with raster in GeoEas format
- imeas (str) – key indicating which structural measure to compute:
'var'
for semi-variogram or'covar'
for covariogram. - gridspecs (list(GridSpec)) – array with grid specifications using
GridSpec
objects - rtol (float) – the tolerance. Default is 1e-10
Returns: output array with variogram or covariogram map, depending on variogram choice, with size: in 1D: ( 2*nxOutHalf+1 ) or in 2D: ( 2*nxOutHalf+1 x 2*nxOutHalf+1 ).
output array with number of pairs available in each lag, of same size as outStruct
Return type: tuple(np.ndarray, np.ndarray)
References
- Originally implemented in MATLAB by:
- Phaedon Kyriakidis, Department of Geography, University of California Santa Barbara, May 2005
- Reimplemented into Python by:
- Jonah Bartrand, Department of Geophysics, Colorado School of Mines, October 2018
- Algorith based on:
- Marcotte, D. (1996): Fast Variogram Computation with FFT, Computers & Geosciences, 22(10), 1175-1186.
suprts2modelcovFFT¶
-
wtools.geostats.raster.
suprts2modelcovFFT
(CovMapExtFFT, ind1Ext, sf1Ext, ind2Ext, sf2Ext)[source]¶ Integrated model covariances between 1 or 2 sets of arbitrary supports. Function to calculate array of TOTAL or AVERAGE model covariances between 1 or 2 sets of irregular supports, using convolution in the frequency domain (FFT-based). Integration or averaging is IMPLICIT in the pre-computed sampling functions (from discrsuprtsFFT).
Parameters: - CovMapExtFFT (np.ndarray) – Fourier transform of model covariance map evaluated at nodes of an extended MATLAB grid
- ind1Ext – (nSup1 x 1) cell array with MATLAB indices of non-zero sampling function values for support set #1 in extended MATLAB grid
- sf1Ext – (nSup1 x 1) cell array with sampling function values for support set #1
- ind2Ext – Optional (nSup2 x 1) cell array with MATLAB indices of non-zero sampling function values for support set #2 in extended MATLAB grid
- sf2Ext – Optional (nSup2 x 1) cell array with sampling function values for support set #2
Returns: (nSup1 x nSup[1,2]) array with integrated covariances
Return type: np.ndarray
References
- Originally implemented in MATLAB by:
- Phaedon Kyriakidis, Department of Geography, University of California Santa Barbara, May 2005
- Reimplemented into Python by:
- Bane Sullivan and Jonah Bartrand, Department of Geophysics, Colorado School of Mines, October 2018
Docs Automatically Generated
Learn more about automatic documentation