Some comments:
1) IIRC VBO is analog of DirectX VertexBuffers, so same optimizations apply
2) If Vertex-Size gets bigger than 32byte you suffer a performance drop. - Have you tried 64byte vertices?