from examples.cfd import plot_field, init_hat
import numpy as np
%matplotlib inline
# Some variable declarations
= 50
nx = 50
ny = 100
nt = 0.
xmin = 2.
xmax = 0.
ymin = 1.
ymax
= (xmax - xmin) / (nx - 1)
dx = (ymax - ymin) / (ny - 1)
dy
Example 6: Poisson equation
In the previous example we have seen how to solve the steady-state Laplace equation with a prescribed boundary condition and BCs that contain derivatives (Neumann BC). For this we used a pseudo-timestepping to illustrate the use of two explicit data symbols of type Function
to derive at a steady state. In this example we are going to solve the Poisson equation (step 10 of the original tutorial series) with an initial condition to demonstrate how we can combine time-dependent functions of type TimeFunction
with time-constant Function
symbols. We will again implement a Python-driven outer loop for the diffusion kernel, before demonstrating how we can change the expression defining our operator to internalize the pseudo-timestepping and thus get the full performance advantage of compiled kernel loops.
Our governing equation is the Poisson equation with a source term as the RHS: \[\frac{\partial ^2 p}{\partial x^2} + \frac{\partial ^2 p}{\partial y^2} = b\]
Discretizing and rearranging our stencil expression yields \[p_{i,j}^{n}=\frac{(p_{i+1,j}^{n}+p_{i-1,j}^{n})\Delta y^2+(p_{i,j+1}^{n}+p_{i,j-1}^{n})\Delta x^2-b_{i,j}^{n}\Delta x^2\Delta y^2}{2(\Delta x^2+\Delta y^2)}\]
The key difference to the Laplace example in the previous tutorial is that this example has simple Dirichlet boundary conditions \(p = 0\) on all boundaries and is instead driven by the source term \(b\) defined as
\(b_{i,j}=100\) at \(i=nx/4,\ j=ny/4\)
\(b_{i,j}=-100\) at \(i=3/4 nx,\ j=3/4 ny\)
\(b_{i,j}=0\) everywhere else.
And of course, we will demonstrate the original example first using pure NumPy notation.
# Initialization
= np.zeros((nx, ny))
p = np.zeros((nx, ny))
pd = np.zeros((nx, ny))
b
# Source
int(nx / 4), int(ny / 4)] = 100
b[int(3 * nx / 4), int(3 * ny / 4)] = -100 b[
%%time
#NBVAL_IGNORE_OUTPUT
for it in range(nt):
= p.copy()
pd 1:-1,1:-1] = (((pd[1:-1, 2:] + pd[1:-1, :-2]) * dy**2 +
p[2:, 1:-1] + pd[:-2, 1:-1]) * dx**2 -
(pd[1:-1, 1:-1] * dx**2 * dy**2) /
b[2 * (dx**2 + dy**2)))
(
0, :] = 0
p[-1, :] = 0
p[nx0] = 0
p[:, -1] = 0
p[:, ny
CPU times: user 4.24 ms, sys: 0 ns, total: 4.24 ms
Wall time: 3.57 ms
#NBVAL_IGNORE_OUTPUT
=xmax, ymax=ymax, view=(30, 225)) plot_field(p, xmax
We can now pretty much use our previous implementation, although we will use pd
instead of pn
for consistency with the original. Our boundary conditions are even simpler since they are \(0\) everywhere. Our source term is neatly wrapped in the symbol b
and we can again use a Python-driven timestepping loop with switching buffers to repeatedly execute the operator we create.
from devito import Grid, Function, TimeFunction, Operator, configuration, Eq, solve
# Silence the runtime performance logging
'log-level'] = 'ERROR'
configuration[
= Grid(shape=(nx, ny), extent=(xmax, ymax))
grid = Function(name='p', grid=grid, space_order=2)
p = Function(name='pd', grid=grid, space_order=2)
pd = 0.
p.data[:] = 0.
pd.data[:]
# Initialise the source term `b`
= Function(name='b', grid=grid)
b = 0.
b.data[:] int(nx / 4), int(ny / 4)] = 100
b.data[int(3 * nx / 4), int(3 * ny / 4)] = -100
b.data[
# Create Laplace equation base on `pd`
= Eq(pd.laplace, b, subdomain=grid.interior)
eq # Let SymPy solve for the central stencil point
= solve(eq, pd)
stencil # # Now we let our stencil populate our second buffer `p`
= Eq(p, stencil)
eq_stencil
# Create boundary condition expressions
= grid.dimensions
x, y = grid.stepping_dim
t = [Eq(p[x, 0], 0.)]
bc += [Eq(p[x, ny-1], 0.)]
bc += [Eq(p[0, y], 0.)]
bc += [Eq(p[nx-1, y], 0.)]
bc
# Now we can build the operator that we need
= Operator([eq_stencil] + bc) op
%%time
#NBVAL_IGNORE_OUTPUT
# Run the outer loop explicitly in Python
for i in range(nt):
# Determine buffer order
if i % 2 == 0:
= p
_p = pd
_pd else:
= pd
_p = p
_pd
# Apply operator
=_p, pd=_pd) op(p
CPU times: user 73.1 ms, sys: 9.91 ms, total: 83 ms
Wall time: 84 ms
#NBVAL_IGNORE_OUTPUT
# Plot result
=xmax, ymax=ymax, view=(30, 225)) plot_field(p.data, xmax
Nice, we get the same spikes as the pure NumPy implementation. But we still drive the time loop though Python, which is slow. So let’s try and utilise Devito’s time-stepping mechanism that we saw earlier for this “pseudo-timestepping”.
The trick here is to again use TimeFunction
symbols, despite the lack of a time-dependence. This will cause Devito to allocate two grid buffers instead of one, which we can directly address them via the terms p
and p.forward
. This will now create an internal time loop in the kernel that we can again control by supplying the time
argument to the create operator during invocation.
One caveat, however, is that a TimeFunction
symbol has an implicit time dimension t
, so the symbol is denoted as p(t, x, y)
. This entails that, when creating our boundary condition expressions, we now need honour the leading time dimension in the “low-level” notation, and ensure that we update the index t + 1
(the equivalent to u.forward
).
# Now with Devito we will turn `p` into `TimeFunction` object
# to make all the buffer switching implicit
= TimeFunction(name='p', grid=grid, space_order=2)
p = 0.
p.data[:]
# Initialise the source term `b`
= Function(name='b', grid=grid)
b = 0.
b.data[:] int(nx / 4), int(ny / 4)] = 100
b.data[int(3 * nx / 4), int(3 * ny / 4)] = -100
b.data[
# Create Laplace equation base on `p`
= Eq(p.laplace, b)
eq # Let SymPy solve for the central stencil point
= solve(eq, p)
stencil # Let our stencil populate the buffer `p.forward`
= Eq(p.forward, stencil)
eq_stencil
# Create boundary condition expressions
# Note that we now add an explicit "t + 1" for the time dimension.
= [Eq(p[t + 1, x, 0], 0.)]
bc += [Eq(p[t + 1, x, ny-1], 0.)]
bc += [Eq(p[t + 1, 0, y], 0.)]
bc += [Eq(p[t + 1, nx-1, y], 0.)] bc
#NBVAL_IGNORE_OUTPUT
'log-level'] = 'ERROR'
configuration[# Create and execute the operator for a number of timesteps
= Operator([eq_stencil] + bc)
op %time op(time=nt)
CPU times: user 19 ms, sys: 3.28 ms, total: 22.2 ms
Wall time: 24.2 ms
PerformanceSummary([(PerfKey(name='section0', rank=None),
PerfEntry(time=0.0001769999999999997, gflopss=0.0, gpointss=0.0, oi=0.0, ops=0, itershapes=[])),
(PerfKey(name='section1', rank=None),
PerfEntry(time=4e-06, gflopss=0.0, gpointss=0.0, oi=0.0, ops=0, itershapes=[]))])
#NBVAL_IGNORE_OUTPUT
0], xmax=xmax, ymax=ymax, view=(30, 225)) plot_field(p.data[