geemili/drm - drm - Forgejo: Beyond coding. We Forge.

Commit Graph

Author	SHA1	Message	Date
Daniel Vetter	e6018c25ca	intel: Fixup for the fix for relaxed tiling on gen2 This is Fail. First patch to libdrm, and I've borked it up. Noticed-by: Chris Wilson <chris@chris-wilson.co.uk> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-02-22 19:11:33 +01:00
Daniel Vetter	9a71ed93f4	intel: fix relaxed tiling on gen2 Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-02-22 18:53:56 +01:00
Chris Wilson	36d4939343	intel: Remember named bo ... and if asked to open a bo by the same global name, return a fresh reference to the previously allocated buffer. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-02-14 10:18:39 +00:00
Chris Wilson	53581b6210	intel: Set the public handle after opening by name Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-02-14 10:15:31 +00:00
Chris Wilson	550fe2ca3b	intel: compile fix for previous commit after rebasing Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-01-12 15:06:25 +00:00
Chris Wilson	6717b7579f	intel: Fallback to old exec if no mrb_exec is available Reported-by: Torsten Hilbrich <torsten.hilbrich@secunet.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=33016 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-01-12 11:00:13 +00:00
Chris Wilson	0184bb1c6d	intel: Export CONSTANT_BUFFER addressing mode Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-12-19 13:01:15 +00:00
Chris Wilson	537703fd48	intel: Reorder need_fence vs fenced_command to avoid fences on gen4 gen4+ hardware doesn't use fences for GPU access and the older kernel doesn't expect userspace to make such a mistake. So don't. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=32190 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-12-07 20:34:22 +00:00
Chris Wilson	af3d282afb	intel: If the command is fenced inform the kernel ... but only account for a fenced used if the object is tiled. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-12-03 10:51:49 +00:00
Chris Wilson	1443bea488	intel: Add a forward declaration of struct drm_clip_rect ... so that intel_bufmgr.h can be compiled standalone. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-11-25 16:59:20 +00:00
Chris Wilson	51b895041c	intel: Compute in-aperture size for relaxed fenced objects For relaxed fencing the object may only consume the small set of active pages, but still requires a fence region once bound into the aperture. This is the size we need to use when computing the maximum possible aperture space that could be used by a single batchbuffer and so avoid hitting ENOSPC. Reported-by: Daniel Vetter <daniel.vetter@ffwll.ch> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-11-22 09:54:47 +00:00
Eric Anholt	877b2ce15b	intel: Fix drm_intel_gem_bo_wait_rendering to wait for read-only usage too. Both the consumers of this API (sync objects and client throttling) were expecting this behavior. The kernel used to actually behave the desired (but incorrect) way for us anyway, but that got fixed a while back.	2010-11-09 13:57:19 -08:00
Albert Damen	49447a9b95	intel: initialize bufmgr.bo_mrb_exec unconditionally If bufmgr.bo_mrb_exec is not set, drm_intel_bo_mrb_exec returns ENODEV even though drm_intel_gem_bo_mrb_exec2 will work fine for the RENDER ring. Fixes xf86-video-intel after commit 'add BLT ring support' (5bed685f76) with kernels without BSD or BLT ring support (2.6.34 and before). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=31443 Signed-off-by: Albert Damen <albrt@gmx.net> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-11-07 14:59:55 +00:00
Eric Anholt	a52e61b5c8	intel: Drop silly asserts on mappings present at unmap time. The intent of these was to catch mismatched map/unmap. What it actually did was check whether there was ever a mapping of that type (including in a previous life of the buffer through the userland BO cache), not whether they were mismatched. We don't even actually want to catch mismatched map/unmap, unless we also do refcounting, since at one point Mesa would do map/map/use/unmap/unmap. Just remove this code instead.	2010-11-02 11:32:32 -07:00
Eric Anholt	4abb65f95c	intel: Remove gratuitous assert on bo_reference. This couldn't be triggered except by overflow, since there's an assert in unreference to catch the usual failure of over-unreferencing.	2010-11-02 11:19:21 -07:00
Eric Anholt	f45305c1aa	intel: Shove the fake bufmgr subdata implementation into the fake bufmgr.	2010-11-01 06:54:58 -07:00
Eric Anholt	6560b4766c	intel: Remove stale comment.	2010-11-01 06:50:04 -07:00
Chris Wilson	362457715f	intel: enable relaxed fence allocation for i915 The kernel has always allowed userspace to underallocate objects supplied for fencing. However, the kernel only allocated the object size for the fence in the GTT and so caused tiling corruption. More recently the kernel does allocate the full fence region in the GTT for an under-sized object and so advertises that clients may finally make use of this feature. The biggest benefit is for texture-heavy GL games on i945 such as World of Padman which go from needing over 1GiB of RAM to play to fitting in the GTT! Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-10-29 10:49:54 +01:00
Chris Wilson	057fab3382	intel: Prepare for BLT ring split. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-10-26 11:41:40 +01:00
Chris Wilson	96214860bb	intel: Downgrade error warnings to debug As the higher layers check the error return from libdrm-intel and are supposed to handle the error (and print their own warning in extremis) the voluminous output on stderr is just noise and a hazard in its own right. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-10-01 16:42:38 +01:00
Chris Wilson	6299722c47	intel: Replace open-coded drmIoctl with calls to drmIoctl() Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-09-25 21:38:31 +01:00
Eric Anholt	23287f05cf	Avoid use of c++ reserved keyword "virtual" when using a C++ compiler. Avoids requiring nasty hacks around libdrm headers in the new C++ parts of Mesa drivers.	2010-08-26 15:45:12 -07:00
Chris Wilson	c3ddfea1a6	intel: Suppress the error return from setting domains after mapping. If the mapping succeeds we have a valid pointer. If setting the domain failures we may incur cache corruption. However the usual failure mode is because of a hung GPU, in which case it is preferable to ignore the minor error from setting the domain and continue on oblivious. If these errors persist, we should rate limit the warning [or even just remove it]. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-29 20:16:56 +01:00
Chris Wilson	726210f87d	intel: Limit tiled pitches to 8192 on pre-i965. Fixes: Bug 28515 - Failed to allocate framebuffer when exceed 2048 width https://bugs.freedesktop.org/show_bug.cgi?id=28515 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-24 11:38:00 +01:00
Chris Wilson	6ea2bda5f5	intel: Only adjust the local stride used for SET_TILING in tiled alloc Mesa uses the returned pitch from alloc_tiled, so make sure that we set it correctly before modifying the stride used for the SET_TILING call. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-22 13:03:52 +01:00
Chris Wilson	aba3502190	intel: Restore SET_TILING for non-flinked bo. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-22 13:00:22 +01:00
Chris Wilson	c7bbaca6a3	intel: '===' != '==' Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-22 11:15:56 +01:00
Chris Wilson	cd34cbeb9f	intel: Sanitise strides for linear buffers and SET_TILING Ensure that the user doesn't attempt to specify a stride to use with a linear buffer by forcing such to be zero. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-22 11:09:12 +01:00
Chris Wilson	13e8270504	intel: Print out debugging message following ENOSPC execbuffer() returns ENOSPC if it cannot fit the batch buffer into the aperture which is the error we want to diagnose here. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 22:37:18 +01:00
Chris Wilson	f16b4164d6	intel: Scan the cache for old bo once every second. Rearrange the cache cleanup so that we always scan following a final unreference, and guard against multiple scans in a single second. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 22:37:18 +01:00
Chris Wilson	5eec286838	intel: Force stride to be 0 for I915_TILING_NONE. When allocating a tiled buffer, if we remove the desired tiling mode due to it being beyond hardware limits, also remove the stride. This ensures that we only ever use stride 0 with I915_TILING_NONE. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 22:37:18 +01:00
Chris Wilson	1db22ff741	intel: Defer tiling change to allocation. As we now expose a method to allocate tiled buffers, it makes more sense to defer the SET_TILING until required. Besides the slim chance that it will be a no-op, by delaying the change we are less likely to stall on waiting for a bound buffer to release a fence register. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 22:37:18 +01:00
Chris Wilson	056aa9be04	intel: Track tiling stride We need to inform the kernel if the tiling stride changes and not only for changes of the tiling mode. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 14:25:39 +01:00
Eric Anholt	4f7704aea7	intel: Fix several other paths for buffers pointing at themselves.	2010-06-10 09:02:14 -07:00
Eric Anholt	0ec768e67a	intel: Add more intermediate sizes of cache buckets between powers of 2. We had two cases recently where the rounding to powers of two hurt badly: 4:2:0 YUV HD video frames would round up from 2.2MB to 4MB, and Urban Terror was hitting aperture size limitations. For UT, this is because mipmap trees for power of two texture sizes will land right in the middle between two cache buckets. By giving a few more sizes between powers of two, Urban Terror on my 945 ends up consuming 207MB of GEM objects instead of 272MB, and HD video decode on Ironlake goes from 99MB to 75MB. cairo-perf-diff of the benchmarks for gl and xlib shows a 1.09x and 1.06x speedup and a 1.07x, 1.08x, and 1.11x slowdown. From this, I think this patch was really a no-op in terms of performance for these CPU-bound workloads.	2010-06-10 08:56:56 -07:00
Chris Wilson	e65caeba9e	intel: Convert to untiled pitches if surface is too large for tiling. If the pitch is too large for the hardware to tile, recompute the required surface size based on the untiled pitch and alignments. For the older hardware, which has smaller limits and greater restrictions, this may be a considerable saving in allocation size. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-09 10:18:31 +01:00
Eric Anholt	f179137f8f	Allow a buffer to point at itself and still get relocs. I'm using this in experiments with the i965 Mesa driver.	2010-06-07 17:29:57 -07:00
Zou Nan hai	66375fd6e8	intel: Add support for kernel multi-ringbuffer API. This introduces a new API to exec on BSD ring buffer, for H.264 VLD decoding. Signed-off-by: Xiang Hai hao <haihao.xiang@intel.com> Signed-off-by: Zou Nan hai <nanhai.zou@intel.com>	2010-06-06 15:50:38 -07:00
Eric Anholt	58e54f62c9	intel_bufmgr_fake: fix compile warning.	2010-05-26 12:10:39 -07:00
Chris Wilson	fcf3e616ee	intel: Don't change tiling mode unless the kernel reports success. Fixes: Bug 26686 - Some textures are distorted with libdrm 2.4.18 in GTAVC&GTA3 http://bugs.freedesktop.org/show_bug.cgi?id=26686 This bug continues to haunt me. The kernel SET_TILING ioctl is inconsistent in its return values when reporting an error. If one of its sanity checks fail, then the input values are left unchanged. If the kernel later fails to change the tiling mode, then the input values are modified to match the current tiling on the object. In short, userspace cannot trust the return values upon error and so we must assume that upon error our current tiling mode matches reality and not update.	2010-05-24 18:38:29 +01:00
Chris Wilson	a3305b076c	Revert "intel: We don't need to take the bufmgr lock whilst mapping." This reverts commit `7ca558494d`. This was pushed ahead of an essential review of bo level locking in mesa, without which we cannot know whether removing this lock is safe.	2010-05-13 08:25:56 +01:00
Chris Wilson	07e7589d86	intel: query whether a buffer is reusable. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-11 08:54:06 +01:00
Chris Wilson	7ca558494d	intel: We don't need to take the bufmgr lock whilst mapping.	2010-05-06 22:56:41 +01:00
Chris Wilson	3506173ba7	intel: Use the correct size when allocating reloc_target_info array Thomas tracked down this error with kdm and commit b509640: ==4320== Invalid write of size 8 ==4320== at 0x9A97998: do_bo_emit_reloc (in /usr/lib/libdrm_intel.so.1.0.0) ==4320== by 0x9A97B9C: drm_intel_gem_bo_emit_reloc (in /usr/lib/libdrm_intel.so.1.0.0) ==4320== by 0xAED3234: intel_batchbuffer_emit_reloc (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF13827: brw_emit_vertices (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF1F14D: brw_upload_state (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF12122: brw_draw_prims (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xB256824: vbo_exec_vtx_flush (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0xB2523BB: vbo_exec_FlushVertices_internal (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0xB252411: vbo_exec_FlushVertices (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0xB195A3D: _mesa_PopAttrib (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0x8DF0F02: __glXDisp_Render (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== by 0x8DF517F: __glXDispatch (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== Address 0x126a8b80 is 0 bytes after a block of size 16,368 alloc'd ==4320== at 0x4C23E03: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==4320== by 0x9A97A64: do_bo_emit_reloc (in /usr/lib/libdrm_intel.so.1.0.0) ==4320== by 0x9A97B9C: drm_intel_gem_bo_emit_reloc (in /usr/lib/libdrm_intel.so.1.0.0) ==4320== by 0xAED3234: intel_batchbuffer_emit_reloc (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF191DB: upload_binding_table_pointers (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF1F14D: brw_upload_state (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF12122: brw_draw_prims (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xB255EF6: vbo_exec_DrawArrays (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0x8DF67A3: __glXDisp_DrawArrays (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== by 0x8DF0F02: __glXDisp_Render (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== by 0x8DF517F: __glXDispatch (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== by 0x446293: ??? (in /usr/bin/Xorg) which is simply due to only allocating space for the pointers and not the structs themselves. D'oh. Reported-by: Thomas Bächler <thomas@archlinux.org> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-04-11 18:48:31 +01:00
Eric Anholt	ef36c9a3b2	intel: Install the header file in the libdrm/ directory. Suggested-by: Rémi Cardona <remi@gentoo.org> Signed-off-by: Eric Anholt <eric@anholt.net>	2010-03-17 12:49:10 -07:00
Julien Cristau	d271336925	libdrm_intel.pc: don't include ${includedir}/drm intel_bufmgr.h is installed in ${includedir} directly, and the other headers are taken care of by libdrm.pc's Cflags. Signed-off-by: Julien Cristau <jcristau@debian.org> Signed-off-by: Eric Anholt <eric@anholt.net>	2010-03-17 12:45:23 -07:00
Eric Anholt	7c697b1670	intel: Align untiled buffer pitch to 64B. This is the largest untiled pitch requirement from gen2 through gen4. It's only the case for gen3 rendering to color regions with depth, but it's rare for this to be a significant factor in memory usage -- for example, gen4 requires 1 or 2 times the element size, or up to 64 bytes depending on the size of the elements. This is easier than encoding all the various little quirks for untiled pitch alignment, since we rarely do untiled now.	2010-03-17 11:15:45 -07:00
Pauli Nieminen	21105bc186	libdrm: Move intel_atomic.h to libdrm core for sharing. intel_atomic.h includes very usefull atomic operations for lock free parrallel access of variables. Moving these to core libdrm for code sharing with radeon. Signed-off-by: Pauli Nieminen <suokkos@gmail.com>	2010-03-17 11:48:00 +02:00
Chris Wilson	a4041e096c	intel: Repeat execbuffer if interrupted by signal Repeat while EINTR, not EAGAIN! One more source of corruption erradicated, hurray! Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-03-07 14:17:24 +00:00
Eric Anholt	1d4d1e6b13	intel: Only align Y-tiling pitch to the Y tile width. Fixes piglit depth-tex-modes on gen4.	2010-03-04 16:27:45 -08:00

1 2

79 Commits (fd3ed34a2070fca3804baf54ece40d0bc2666226)