After finishing my projects, I finally have time to work on some way long overdue features.
I'm doing a lot of work on my user interface revamp, and also doing a lot of code cleanup since now Windows Phone gives us access (also long overdue) to all nice features like P/Invoke, Direct2d....
This means after a bit of tweaking on SharpDX code base, windows phone is now AnyCpu, which is really great, having x86, x64, ARM and lot of targets to maintain is not pleasing at all.
Also I decided to move on and remove the old DirectX11/DirectX11.1 targets on my toolbox. No more Windows 7 support, I prefer to focus on having a better build for Desktop / RT / Phone instead.
Now after all this I thought it was time to work on another (well overdue as well) feature.
I've always been interested into processing geometry in compute shaders instead of using StreamOut.
This gives some pretty useful advantages:
Pretty simple, with two things to note:
I'm doing a lot of work on my user interface revamp, and also doing a lot of code cleanup since now Windows Phone gives us access (also long overdue) to all nice features like P/Invoke, Direct2d....
This means after a bit of tweaking on SharpDX code base, windows phone is now AnyCpu, which is really great, having x86, x64, ARM and lot of targets to maintain is not pleasing at all.
Also I decided to move on and remove the old DirectX11/DirectX11.1 targets on my toolbox. No more Windows 7 support, I prefer to focus on having a better build for Desktop / RT / Phone instead.
Now after all this I thought it was time to work on another (well overdue as well) feature.
I've always been interested into processing geometry in compute shaders instead of using StreamOut.
This gives some pretty useful advantages:
- Write in place : RWStructuredBuffer rocks, no need for ping/pong
- Writing Indexed geometry using Geometry Shader is doable, but rather painful.
- Much easier to do more advanced post processing
- Access to indirect Draw, which means it's much easier to instance generated geometry, StreamOut does not give us access to internal counter, except using a query, eg: not ideal.
Here are my structures:
Code Snippet
- private DX11StructuredBuffer positionbuffer;
- private DX11StructuredBuffer normalsbuffer;
- private DX11StructuredBuffer uvbuffer;
- private DX11StructuredBuffer appendindexbuffer;
- private DX11InstancedIndexedDrawer drawer;
- private DispatchIndirectBuffer vertexIndirectDispatch;
- private DispatchIndirectBuffer indexIndirectDispacth;
Pretty simple, with two things to note:
- Position buffer is created with the counter flag
- Index Buffer is created with the append flag
Now there are a few bits to consider:
- Structured Buffer is not bindable as Vertex Buffer: This is more or less a non issue in my case, since most of my shaders already fetch data from StructuredBuffers using SV_VertexID
- StructuredBuffer is not bindable as Index Buffer. This is more annoying, since you need to bind it to the pipeline, no way to fetch. This is quite easy to sort still, as you can just create a standard index buffer (which allow raw view and give it UAV access), then just use a compute shader to copy indices.
So with this setup we are more or less set, you can process per vertex using VertexIndirectDispatcher, process per face using IndexIndirectDispatcher, and since you have access to counter/append, you can also easily emit/remove Geometry.
Now one main issue with this setup, you need to feed your buffers with some initial geometry.
You have a very simple way to do this in DirectX11.1 , but the feature is only supported on ATI.
So instead, let's replace my old monolithic geometry builders by something more flexible
For now my geometry builders look like this:
Code Snippet
- public DX11IndexedGeometry QuadNormals(Quad settings)
- {
- DX11IndexedGeometry geom = new DX11IndexedGeometry(device);
- geom.Tag = settings;
- geom.PrimitiveType = settings.PrimitiveType;
- List<Pos4Norm3Tex2Vertex> vertices = new List<Pos4Norm3Tex2Vertex>();
- List<int> indices = new List<int>();
- //Build your array
- geom.VertexBuffer = DX11VertexBuffer.CreateImmutable(device, vertices.ToArray());
- geom.IndexBuffer = DX11IndexBuffer.CreateImmutable(device, indices.ToArray());
- geom.InputLayout = Pos4Norm3Tex2Vertex.Layout;
- geom.VertexBuffer.InputLayout = geom.InputLayout;
- geom.Topology = PrimitiveTopology.TriangleList;
- geom.HasBoundingBox = true;
- return geom;
- }
This has a bit too many things to do:
The big problem is that all is in one function, so let's improve and decouple this a little bit
First geometry builders do some pretty simple things, append vertices and append faces, so let's change the construction code like this.
- Read settings
- Build Vertices/Indices list
- Create Geometry resource
The big problem is that all is in one function, so let's improve and decouple this a little bit
First geometry builders do some pretty simple things, append vertices and append faces, so let's change the construction code like this.
Code Snippet
- public class SegmentBuilder : IGeometryBuilder<Segment>
- {
- public PrimitiveInfo GetPrimitiveInfo(Segment settings)
- {
- float phase = settings.Phase;
- float cycles = settings.Cycles;
- float inner = settings.InnerRadius;
- int res = settings.Resolution;
- bool flat = settings.Flat;
- int vcount = res * 2;
- int icount = (res - 1) * 6;
- return new PrimitiveInfo(vcount, icount);
- }
- public void Construct(Segment settings, Action<Vector3, Vector3, Vector2> appendVertex, Action<Int3> appendIndex)
- {
- }
- }
Now we have one builder class responsible to build geometry, but it has no idea of which data structure we use anymore, which is great, since now it's very easy to:
Now to construct our geometry, we can simply do:
What is great by passing functions instead of interfaces, it's much easier to composite actions, for example, we can easily compute bounding box while we create our List.
We can also attach several Appenders, like:
- Use different data back end (DataStream, List...)
- Pack data the way we want
- Completely bypass something (if resolution parameter does not change for example, we can set appendIndex function to null, so we completely ignore building faces).
Here is a simple example to append data into lists:
Code Snippet
- public class ListGeometryAppender
- {
- private List<Pos4Norm3Tex2Vertex> vertices = new List<Pos4Norm3Tex2Vertex>();
- private List<int> indices = new List<int>();
- public List<Pos4Norm3Tex2Vertex> Vertices { get { return this.vertices; } }
- public List<int> Indices { get { return this.indices; } }
- public void AppendVertex(Vector3 position, Vector3 normal, Vector2 uv)
- {
- Pos4Norm3Tex2Vertex v = new Pos4Norm3Tex2Vertex()
- {
- Position = new Vector4(position.X, position.Y, position.Z, 1.0f),
- Normals = normal,
- TexCoords = uv
- };
- this.vertices.Add(v);
- }
- public void AppendIndex(Int3 index)
- {
- this.indices.Add(index.X);
- this.indices.Add(index.Y);
- this.indices.Add(index.Z);
- }
- public void TransformVertices(Func<Pos4Norm3Tex2Vertex,Pos4Norm3Tex2Vertex> transformFunction)
- {
- for (int i = 0; i < this.vertices.Count; i++)
- {
- Pos4Norm3Tex2Vertex v = vertices[i];
- vertices[i] = transformFunction(v);
- }
- }
- }
Now to construct our geometry, we can simply do:
Code Snippet
- public partial class PrimitivesManager
- {
- public DX11IndexedGeometry Cylinder(Cylinder settings)
- {
- CylinderBuilder builder = new CylinderBuilder();
- ListGeometryAppender appender = new ListGeometryAppender();
- PrimitiveInfo info = builder.GetPrimitiveInfo(settings);
- builder.Construct(settings, appender.AppendVertex, appender.AppendIndex);
- return FromAppender(settings, appender, info);
- }
- private DX11IndexedGeometry FromAppender(AbstractPrimitiveDescriptor descriptor, ListGeometryAppender appender, PrimitiveInfo info)
- {
- DX11IndexedGeometry geom = new DX11IndexedGeometry(device);
- geom.Tag = descriptor;
- geom.PrimitiveType = descriptor.PrimitiveType;
- geom.VertexBuffer = DX11VertexBuffer.CreateImmutable(device, appender.Vertices.ToArray());
- geom.IndexBuffer = DX11IndexBuffer.CreateImmutable(device, appender.Indices.ToArray());
- geom.InputLayout = Pos4Norm3Tex2Vertex.Layout;
- geom.Topology = PrimitiveTopology.TriangleList;
- geom.HasBoundingBox = info.IsBoundingBoxKnown;
- geom.BoundingBox = info.BoundingBox;
- return geom;
- }
- }
What is great by passing functions instead of interfaces, it's much easier to composite actions, for example, we can easily compute bounding box while we create our List.
Code Snippet
- SegmentBuilder builder = new SegmentBuilder();
- ListGeometryAppender appender = new ListGeometryAppender();
- PrimitiveInfo info = builder.GetPrimitiveInfo(settings);
- Vector3 max = new Vector3(float.MinValue, float.MinValue, float.MinValue);
- Vector3 min = new Vector3(float.MaxValue, float.MaxValue, float.MaxValue);
- builder.Construct(settings, (v, n, u) =>
- { appender.AppendVertex(v, n, u); min = Vector3.Min(min, v); max = Vector3.Max(max, v); }, appender.AppendIndex);
We can also attach several Appenders, like:
Code Snippet
- SegmentBuilder builder = new SegmentBuilder();
- ListGeometryAppender appender = new ListGeometryAppender();
- MultiListGeometryAppender multilist = new MultiListGeometryAppender();
- builder.Construct(settings, (v, n, u) =>
- {
- appender.AppendVertex(v, n, u);
- multilist.AppendVertex(v, n, u);
- }
- ,appender.AppendIndex);
And you noticed we can optimize by only filling one index buffer ;)
To build StructuredBuffers instead of geometry, here we are:
Code Snippet
- SegmentBuilder builder = new SegmentBuilder();
- MultiListGeometryAppender appender = new MultiListGeometryAppender();
- builder.Construct(settings, appender.AppendVertex, appender.AppendIndex);
- DX11StructuredBuffer intialposition = DX11StructuredBuffer.CreateImmutable<Vector3>(device, appender.Positions.ToArray());
- DX11StructuredBuffer initialindices = DX11StructuredBuffer.CreateImmutable<Int3>(device, appender.Indices.ToArray());
Also, we can write to a dynamic buffer directly:
Code Snippet
- SegmentBuilder builder = new SegmentBuilder();
- PrimitiveInfo info = builder.GetPrimitiveInfo(settings);
- DX11StructuredBuffer dynamicposition = DX11StructuredBuffer.CreateDynamic<Vector3>(device, info.VerticesCount);
- DataStream ds = dynamicposition.MapForWrite(context);
- builder.Construct(settings, (p,n,u) => ds.Write(p) , null);
- dynamicposition.Unmap(context);
Here we are, next post I'll show a few examples of what we can now do with compute geometry (and some new idea for particle system ;)