LSTM recurrent layer. More...

#include "all_layers.hpp"

Inheritance diagram for cv::dnn::LSTMLayer:

Public Member Functions
int	inputNameToIndex (String inputName)
	Returns index of input blob into the input array. More...

int	outputNameToIndex (String outputName)
	Returns index of output blob in output array. More...

virtual void	setOutShape (const MatShape &outTailShape=MatShape())=0
	Specifies shape of output blob which will be [[`T`], `N`] + `outTailShape`. More...

virtual void	setProduceCellOutput (bool produce=false)=0
	If this flag is set to true then layer will produce \( c_t \) as second output. More...

virtual void	setUseTimstampsDim (bool use=true)=0
	Specifies either interpet first dimension of input blob as timestamp dimenion either as sample. More...

virtual void	setWeights (const Mat &Wh, const Mat &Wx, const Mat &b)=0

Public Member Functions inherited from cv::dnn::Layer
	Layer ()

	Layer (const LayerParams &params)
	Initializes only name, type and blobs fields. More...

virtual	~Layer ()

virtual void	applyHalideScheduler (Ptr< BackendNode > &node, const std::vector< Mat *> &inputs, const std::vector< Mat > &outputs, int targetId) const
	Automatic Halide scheduling based on layer hyper-parameters. More...

virtual void	finalize (const std::vector< Mat *> &input, std::vector< Mat > &output)
	Computes and sets internal parameters according to inputs, outputs and blobs. More...

void	finalize (const std::vector< Mat > &inputs, std::vector< Mat > &outputs)

std::vector< Mat >	finalize (const std::vector< Mat > &inputs)

virtual void	forward (std::vector< Mat *> &input, std::vector< Mat > &output, std::vector< Mat > &internals)=0
	Given the `input` blobs, computes the output `blobs`. More...

void	forward (const std::vector< Mat > &inputs, std::vector< Mat > &outputs, std::vector< Mat > &internals)

virtual int64	getFLOPS (const std::vector< MatShape > &inputs, const std::vector< MatShape > &outputs) const

virtual bool	getMemoryShapes (const std::vector< MatShape > &inputs, const int requiredOutputs, std::vector< MatShape > &outputs, std::vector< MatShape > &internals) const

virtual Ptr< BackendNode >	initHalide (const std::vector< Ptr< BackendWrapper > > &inputs)
	Returns Halide backend node. More...

void	run (const std::vector< Mat > &inputs, std::vector< Mat > &outputs, std::vector< Mat > &internals)
	Allocates layer and computes output. More...

virtual bool	setActivation (const Ptr< ActivationLayer > &layer)
	Tries to attach to the layer the subsequent activation layer, i.e. do the layer fusion in a partial case. More...

virtual bool	setBatchNorm (const Ptr< BatchNormLayer > &layer)
	Tries to attach to the layer the subsequent batch normalization layer, i.e. do the layer fusion in a partial case. More...

void	setParamsFrom (const LayerParams &params)
	Initializes only name, type and blobs fields. More...

virtual bool	setScale (const Ptr< ScaleLayer > &layer)
	Tries to attach to the layer the subsequent scaling layer, i.e. do the layer fusion in a partial case. More...

virtual bool	supportBackend (int backendId)
	Ask layer if it support specific backend for doing computations. More...

virtual Ptr< BackendNode >	tryAttach (const Ptr< BackendNode > &node)
	Implement layers fusing. More...

virtual void	unsetAttached ()
	"Deattaches" all the layers, attached to particular layer. More...

Public Member Functions inherited from cv::Algorithm
	Algorithm ()

virtual	~Algorithm ()

virtual void	clear ()
	Clears the algorithm state. More...

virtual bool	empty () const
	Returns true if the Algorithm is empty (e.g. in the very beginning or after unsuccessful read. More...

virtual String	getDefaultName () const

virtual void	read (const FileNode &fn)
	Reads algorithm parameters from a file storage. More...

virtual void	save (const String &filename) const

virtual void	write (FileStorage &fs) const
	Stores algorithm parameters in a file storage. More...

Static Public Member Functions
static Ptr< LSTMLayer >	create (const LayerParams &params)

Static Public Member Functions inherited from cv::Algorithm
template<typename _Tp >
static Ptr< _Tp >	load (const String &filename, const String &objname=String())
	Loads algorithm from the file. More...

template<typename _Tp >
static Ptr< _Tp >	loadFromString (const String &strModel, const String &objname=String())
	Loads algorithm from a String. More...

template<typename _Tp >
static Ptr< _Tp >	read (const FileNode &fn)
	Reads algorithm from the file node. More...

Additional Inherited Members
Public Attributes inherited from cv::dnn::Layer
std::vector< Mat >	blobs
	List of learned parameters must be stored here to allow read them by using Net::getParam(). More...

String	name
	Name of the layer instance, can be used for logging or other internal purposes. More...

String	type
	Type name which was used for creating layer by layer factory. More...

Protected Member Functions inherited from cv::Algorithm
void	writeFormat (FileStorage &fs) const

Detailed Description

LSTM recurrent layer.

Member Function Documentation

§ create()

static Ptr<LSTMLayer> cv::dnn::LSTMLayer::create ( const LayerParams & params )

static

Creates instance of LSTM layer

§ inputNameToIndex()

int cv::dnn::LSTMLayer::inputNameToIndex ( String inputName )

virtual

Returns index of input blob into the input array.

Parameters

inputName label of input blob

Each layer input and output can be labeled to easily identify them using "%<layer_name%>[.output_name]" notation. This method maps label of input blob to its index into input vector.

Reimplemented from cv::dnn::Layer.

§ outputNameToIndex()

int cv::dnn::LSTMLayer::outputNameToIndex ( String outputName )

virtual

Returns index of output blob in output array.

See also: inputNameToIndex()

Reimplemented from cv::dnn::Layer.

§ setOutShape()

virtual void cv::dnn::LSTMLayer::setOutShape ( const MatShape & outTailShape = MatShape() )

pure virtual

Specifies shape of output blob which will be [[T], N] + outTailShape.

If this parameter is empty or unset then outTailShape = [Wh.size(0)] will be used, where Wh is parameter from setWeights().

§ setProduceCellOutput()

virtual void cv::dnn::LSTMLayer::setProduceCellOutput ( bool produce = false )

pure virtual

If this flag is set to true then layer will produce \( c_t \) as second output.

Shape of the second output is the same as first output.

§ setUseTimstampsDim()

virtual void cv::dnn::LSTMLayer::setUseTimstampsDim ( bool use = true )

pure virtual

Specifies either interpet first dimension of input blob as timestamp dimenion either as sample.

If flag is set to true then shape of input blob will be interpeted as [T, N, [data dims]] where T specifies number of timpestamps, N is number of independent streams. In this case each forward() call will iterate through T timestamps and update layer's state T times.

If flag is set to false then shape of input blob will be interpeted as [N, [data dims]]. In this case each forward() call will make one iteration and produce one timestamp with shape [N, [out dims]].

§ setWeights()

virtual void cv::dnn::LSTMLayer::setWeights	(	const Mat &	Wh,
		const Mat &	Wx,
		const Mat &	b
	)

pure virtual

Set trained weights for LSTM layer. LSTM behavior on each step is defined by current input, previous output, previous cell state and learned weights.

Let \(x_t\) be current input, \(h_t\) be current output, \(c_t\) be current state. Than current output and current cell state is computed as follows:

\begin{eqnarray*} h_t &= o_t \odot tanh(c_t), \\ c_t &= f_t \odot c_{t-1} + i_t \odot g_t, \\ \end{eqnarray*}

where \(\odot\) is per-element multiply operation and \(i_t, f_t, o_t, g_t\) is internal gates that are computed using learned wights.

Gates are computed as follows:

\begin{eqnarray*} i_t &= sigmoid&(W_{xi} x_t + W_{hi} h_{t-1} + b_i), \\ f_t &= sigmoid&(W_{xf} x_t + W_{hf} h_{t-1} + b_f), \\ o_t &= sigmoid&(W_{xo} x_t + W_{ho} h_{t-1} + b_o), \\ g_t &= tanh &(W_{xg} x_t + W_{hg} h_{t-1} + b_g), \\ \end{eqnarray*}

where \(W_{x?}\), \(W_{h?}\) and \(b_{?}\) are learned weights represented as matrices: \(W_{x?} \in R^{N_h \times N_x}\), \(W_{h?} \in R^{N_h \times N_h}\), \(b_? \in R^{N_h}\).

For simplicity and performance purposes we use \( W_x = [W_{xi}; W_{xf}; W_{xo}, W_{xg}] \) (i.e. \(W_x\) is vertical contacentaion of \( W_{x?} \)), \( W_x \in R^{4N_h \times N_x} \). The same for \( W_h = [W_{hi}; W_{hf}; W_{ho}, W_{hg}], W_h \in R^{4N_h \times N_h} \) and for \( b = [b_i; b_f, b_o, b_g]\), \(b \in R^{4N_h} \).

Parameters

Wh	is matrix defining how previous output is transformed to internal gates (i.e. according to abovemtioned notation is \( W_h \))
Wx	is matrix defining how current input is transformed to internal gates (i.e. according to abovemtioned notation is \( W_x \))
b	is bias vector (i.e. according to abovemtioned notation is \( b \))

The documentation for this class was generated from the following file:

dnn/include/opencv2/dnn/all_layers.hpp

Public Member Functions

Static Public Member Functions

Additional Inherited Members

Detailed Description

Member Function Documentation

§ create()

§ inputNameToIndex()

§ outputNameToIndex()

§ setOutShape()

§ setProduceCellOutput()

§ setUseTimstampsDim()

§ setWeights()