geemili/drm - drm - Forgejo: Beyond coding. We Forge.

Commit Graph

Author	SHA1	Message	Date
Rob Clark	faf26b689d	intel: fix build error CCLD test_decode ./.libs/libdrm_intel.so: undefined reference to `drmPrimeHandleToFD' ./.libs/libdrm_intel.so: undefined reference to `drmPrimeFDToHandle' collect2: ld returned 1 exit status From Adam Jackson's explaination: most distros have changed it so ld defaults to --no-copy-dt-needed-entries, so if you use something from libdrm you can't just assume libdrm_intel will bring it in for you, you have to be explicit Signed-off-by: Rob Clark <rob@ti.com>	2012-07-20 10:28:46 -05:00
Dave Airlie	ff65de9666	intel: add prime interface for getting/setting a prime bo. (v4) This adds interfaces for the X driver to use to create a prime handle from a buffer, and create a bo from a handle. v2: use Chris's suggested naming (well from at least for consistency) v3: git commit --amend fail v4: fix as per Chris's suggestions, group assignments, add get tiling Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk> Signed-off-by: Dave Airlie <airlied@redhat.com>	2012-07-20 08:53:03 +10:00
Kenneth Graunke	992e2afd59	intel: Change context create failure message to from fprintf to DBG(). Since there is no getparam for hardware context support, Mesa always tries to obtain a context by calling drm_intel_gem_context_create and NULL-checking the result. On an older kernel without context support, this caused libdrm to print an unwanted message to stderr: DRM_IOCTL_I915_GEM_CONTEXT_CREATE failed: Invalid argument In fact, this caused every Piglit test to fail with a "warn" status due to the unrecognized error message. Change the message to use DBG() rather than fprintf(), so people can still get the debug message, but it won't spam normally. Reviewed-by: Ben Widawsky <ben@bwidawsk.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-07-12 13:44:21 -07:00
Lauri Kasanen	8fad46b614	intel: Fix build failure in test_decode.c Hi list The recently released libdrm 2.4.37 does not compile the Intel part: test_decode.c: In function 'compare_batch': test_decode.c:107: error: implicit declaration of function 'open_memstream' PS: Please CC me. Signed-off-by: Lauri Kasanen <cand@gmx.com> Signed-off-by: Ben Widawsky <ben@bwidawsk.net>	2012-07-03 09:54:41 -07:00
Ben Widawsky	f7210fa4ec	intel/context: create/destroy implementation Add relevant code to set up minimal state and call the appropriate kernel IOCTLs. This was missed in the previous cherry-picking for 2.3.36. Signed-off-by: Ben Widawsky <ben@bwidawsk.net>	2012-06-29 11:05:47 -07:00
Ben Widawsky	a7805194b7	intel/decode: fix the reference file for I mistakenly "fixed" a bad decode with commit `7d0a1d5ebb` Author: Ben Widawsky <ben@bwidawsk.net> Date: Sun Jun 24 20:35:57 2012 -0700 intel/decode: VERTEX_ELEMENT_STATE, 1 means valid However the actual fix is just to update the reference file, and include GEN7 in the decode. Props to Eric Anholt for putting the test in distcheck, or else I wouldn't have caught this. Signed-off-by: Ben Widawsky <ben@bwidawsk.net>	2012-06-27 10:22:37 -07:00
Ben Widawsky	df5cc69dab	Revert "intel/decode: VERTEX_ELEMENT_STATE, 1 means valid" This reverts commit `7d0a1d5ebb`. The actual fix	2012-06-27 10:21:42 -07:00
Ben Widawsky	ee2be4fa7e	intel: add decoding of MI_SET_CONTEXT Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-27 09:57:17 -07:00
Ben Widawsky	3ed38714bd	intel/context: new execbuf interface for contexts To support this we extract the common execbuf2 functionality to be called with, or without contexts. The context'd execbuf does not support some of the dri1 stuff. Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-27 09:57:17 -07:00
Ben Widawsky	b3b123dfb1	intel/context: Add drm_intel_context type Add an opaque type representing a HW context. Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-27 09:57:16 -07:00
Ben Widawsky	7d0a1d5ebb	intel/decode: VERTEX_ELEMENT_STATE, 1 means valid The logic seemed to be inverse to me. Signed-off-by: Ben Widawsky <ben@bwidawsk.net>	2012-06-27 09:58:35 -07:00
Ben Widawsky	eeedb0dcc4	intel/decode: add sampler state pointers for [HD]S Signed-off-by: Ben Widawsky <ben@bwidawsk.net>	2012-06-27 09:58:35 -07:00
Ben Widawsky	971c080ac0	intel: wait render timeout implementation int drm_intel_gem_bo_wait(drm_intel_bo *bo, uint64_t timeout_ns) This should bump the libdrm version. We're waiting for context support so we can do both features in one bump. v2: don't return remaining timeout amount use get param and fallback for older kernels v3: only doing getparam at init prototypes now have a signed input value v4: update comments fall back to correct polling behavior with new userspace and old kernel v5: since the drmIoctl patch was not well received, return appropriate values in this function instead. As Daniel pointed out, the polling case (timeout == 0) should also return -ETIME. Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-06-27 09:00:58 -07:00
Ben Widawsky	ae137f4669	intel: Add IVB PUSH_CONSTANT decodes Signed-off-by: Ben Widawsky <ben@bwidawsk.net>	2012-06-24 18:11:48 -07:00
Paul Berry	da02f72bc2	intel: Add the ability to supply annotations for .aub files. This patch adds a new function, drm_intel_bufmgr_gem_set_aub_annotations(), which can be used to annotate the type and subtype of data stored in various sections of each buffer. This data is used to populate type and subtype fields when generating the .aub file, which improves the ability of later debugging tools to analyze the contents of the .aub file. If drm_intel_bufmgr_gem_set_aub_annotations() is not called, then we fall back to the old set of annotations (annotate the portion of the batchbuffer that is executed as AUB_TRACE_TYPE_BATCH, and everything else as AUB_TRACE_TYPE_NOTYPE). Reviewed-by: Eric Anholt <eric@anholt.net>	2012-05-10 11:32:39 -07:00
Daniel Vetter	4370425683	intel/decode: decode MI_WAIT_FOR_EVENT ... and add support to decode MI instructions with functions. Reviewed-by: Eugeni Dodonov <eugeni.dodonov@intel.com> Signed-Off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-04-02 15:57:36 +02:00
Eugeni Dodonov	e057a56448	intel: add Ivy Bridge GT2 server variant We were missing this one and it is being used by Bromolow. Signed-off-by: Eugeni Dodonov <eugeni.dodonov@intel.com>	2012-04-01 11:03:36 -03:00
Kenneth Graunke	617213357e	intel: Add some PCI IDs for Haswell. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eugeni Dodonov <eugeni.dodonov@intel.com>	2012-03-22 13:52:29 -07:00
Eric Anholt	5de5b7484a	intel: Quiet two more valgrind complaints with recent changes. These are more cases where valgrind doesn't understand what gets read or written by our ioctls.	2012-03-13 16:49:53 -07:00
Eric Anholt	9d18ad254a	intel: Add per-dword decode of gen7 3DPRIMITIVE.	2012-03-10 09:23:07 -08:00
Eric Anholt	9b87fd9a3d	intel: Move the gen4-6 3DPRIMITIVE handling out of the switch statement.	2012-03-10 09:23:07 -08:00
Eric Anholt	99c73378a1	intel: Add support for (possibly) unsynchronized maps. This improves the performance of Mesa's GL_MAP_UNSYNCHRONIZED_BIT path in GL_ARB_map_buffer_range. Improves Unigine Tropics performance at 1024x768 by 2.30482% +/- 0.0492146% (n=61) v2: Fix comment grammar. Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2012-03-10 09:21:33 -08:00
Eric Anholt	3a8884851b	intel: Fix error check for I915_PARAM_HAS_LLC. drmIoctl returns -1 on error with errno set to the error value. Other users of it in this file just check for != 0, and only use errno when they need to send an error value on to the caller of the API.	2012-03-09 17:21:00 -08:00
Eric Anholt	c9ce2edfc8	intel: Bump the copyright dates on the bufmgr files. We've been hacking these constantly.	2012-03-09 16:34:14 -08:00
Eric Anholt	4db16a9480	intel: Add .aub file output support. This will allow the driver to capture all of its execution state to a file for later debugging. intel_gpu_dump is limited in that it only captures batchbuffers, and Mesa's captures, while more complete, still capture only a portion of the state involved in execution. This is a squash commit of a long series of hacking as we tried to get the resulting traces to work in the internal simulator. It contains contributions by Yuanhan Liu and Kenneth Graunke. v2: Drop the MI_FLUSH_ENABLE setup. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Eric Anholt <eric@anholt.net> Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2012-03-09 16:34:14 -08:00
Kenneth Graunke	6e642db7f4	intel: Add support for overriding the PCI ID via an environment variable For example: export INTEL_DEVID_OVERRIDE=0x162 If this variable is set, don't actually submit the batchbuffer to the GPU, it probably contains commands for the wrong generation of hardware. v2: Introduce a getter for the overridden devid, and avoid getenv per exec. Reviewed-by: Yuanhan Liu <yuanhan.liu@linux.intel.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Eric Anholt <eric@anholt.net>	2012-03-09 16:34:14 -08:00
Eric Anholt	783db34f6d	intel: Import a new batchbuffer for the gen7 test. This one doesn't have the 3DSTATE_HIER_DEPTH_BUFFER bug that the previous one did. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-02-22 12:27:34 -08:00
Eric Anholt	b395af0d2d	intel: Add decode for gen7 HIER_DEPTH_BUFFER. Note that the regression test complains here: The batch that was captured included a bug in its packet output, which was later fixed in Mesa. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-02-22 12:27:25 -08:00
Eric Anholt	e6beaf8ee4	intel: Add decode for gen7 3DSTATE_WM. This requires pulling the gen6 3DSTATE_WM out to a function so it doesn't override gen7's handler. v2: Fix pasteo in interpreting ZW interpolation (thanks danvet!). Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-02-22 12:26:45 -08:00
Eric Anholt	259e7b6138	intel: Fix a typo in decode error message. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-02-22 12:25:19 -08:00
Chris Wilson	23eeb7e1e4	intel: Detect cache domain inconsistency with valgrind Every access to either the GTT or CPU pointer is supposed to be proceeded by a set_domain ioctl so that GEM is able to manage the cache domains correctly and for the following access to be coherent. Of course, some people explicitly want incoherent, non-blocking access which is going to trigger warnings by this patch but are probably better served by explicit suppression. v2: Also mark the pointers as inaccessible following the explicit unmap and implicit unmap upon return to the cache. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-02-15 11:16:59 +00:00
Chris Wilson	90b23cc24c	intel: Mark up with valgrind intrinsics to reduce false positives In particular, declare the hidden CPU mmaps to valgrind so that it knows about those memory regions. v2: Add an additional VG_CLEAR for the getparam References: https://bugs.freedesktop.org/show_bug.cgi?id=35071 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Acked-by: Ben Widawsky <ben@bwidawsk.net> [anholt: Ideally valgrind should just learn about the ioctls, and removing the clear for the non-valgrindified code feels risky.] Reviewed-by: Eric Anholt <eric@anholt.net>	2012-02-11 11:45:39 +00:00
Eugeni Dodonov	151cdcfe68	intel: query for LLC support This adds support for querying the kernel about the LLC support in the hardware. In case the ioctl fails, we assume that it is present on GEN6 and GEN7. v2: fix the return code checking Signed-off-by: Eugeni Dodonov <eugeni.dodonov@intel.com>	2012-02-01 15:54:02 -02:00
Chad Versace	592ac67626	intel: Fix bufmgr_gem->gen for gen > 4 If the pci_device's actual gen was > 4, then we stupidly set bufmgr_gem->gen = 6. Luckily this caused no bugs, and this fix shouldn't change any behavior, because all checks against the gen currently have one of the forms below: gen == 2 gen == 3 gen >= 4 Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Eugeni Dodonov <eugeni.dodonov@intel.com> Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2012-01-30 13:03:35 -08:00
Eric Anholt	b643b0713a	intel: Add minimal decode for remaining gen7 packets in use. This just gets packet name and length in place, with the remainder unfinished. I've long since finished the work that got me started fixing up the decode.	2012-01-27 13:21:20 -08:00
Eric Anholt	54b12a085f	intel: Add decode for gen7 constant buffer packets.	2012-01-27 13:21:20 -08:00
Eric Anholt	938df6be48	intel: Add decode for gen7 state pointers. Since CC_STATE_POINTERS for gen6 and 7 are quite different but use the same opcode, move gen6 out to a helper function too, so we can use a helper function for gen7.	2012-01-27 13:21:20 -08:00
Eric Anholt	6a0b25e66b	intel: Add support for parsing gen7 URB packets.	2012-01-27 13:21:20 -08:00
Eric Anholt	ba8ce2da04	intel: Make most of the logic for 965 3d packet length checks table-driven. This puts the error message in a consistent location relative to the packet, and while I'm here I made the error message a bit more informative. Now, most static length packets need to just declare their length in the table and not worry.	2012-01-27 13:21:20 -08:00
Eric Anholt	b129e10af2	intel: Move the logic for getting 965 3d packet length to the packet table. While I'm touching every line of the table, sort it by opcode.	2012-01-27 13:21:20 -08:00
Eric Anholt	3dcb2d47ee	intel: Add support for parsing 965 3d packets using helper functions. I want to add packets, without contributing to the switch statement of doom.	2012-01-27 13:21:19 -08:00
Eric Anholt	5a1c10fe6a	intel: Parse the correct length for gen7 3DSTATE_MULTISAMPLE.	2012-01-27 13:21:19 -08:00
Eric Anholt	9695eee8a2	intel: Put the "gen" shorthand chipset identifier in the context. It's a lot nicer than using IS_WHATEVER(devid) all over the place, and we have this in our other projects too.	2012-01-27 13:21:19 -08:00
Eric Anholt	028715ee70	intel: Avoid the need for most overflow checks by using a scratch page. The overflow checks were all thoroughly untested, and a bunch of the ones I'm deleting were pretty broken. Now, in the case of overflow, you just decode data of 0xd0d0d0d0, and instr_out prints the warning message instead. Note that this still has the same issue of being under-tested, but at least it's one place instead of per-packet. A couple of BUFFER_FAIL uses are left where the length to be decoded could be (significantly) larger than a page, and the decode didn't just call instr_out (which doesn't dereference data itself unless it's safe).	2012-01-27 13:21:19 -08:00
Eric Anholt	c1d2946da8	intel: Make instr_out take the decode context. This reduces some of the extra derefs of the pointers.	2012-01-27 13:21:19 -08:00
Eric Anholt	b0371612f4	intel: Use the context to simplify BR01 decode. Similar to BR00, count was always 1 and was always an index, not a count.	2012-01-27 13:21:19 -08:00
Eric Anholt	62b410344c	intel: Use the context to simplify BR00 decode. The count (actually index) was always 0, because BR00 is dword 0.	2012-01-27 13:21:19 -08:00
Eric Anholt	de49fd41e2	intel: Plumb the context through the decode callchain. We still deref the context at the start of every call, but that will change next.	2012-01-27 13:21:19 -08:00
Eric Anholt	a756fa384f	intel: Drop the code for counting parsing failures. Nothing was consuming it. If something wants this in the future, would be done using the decode context anyway.	2012-01-27 13:21:19 -08:00
Eric Anholt	8fb66a7ded	intel: Track the current packet location in the decode context. This is the start of plumbing the context through the decode callchain instead of the current 4 arguments.	2012-01-27 13:21:19 -08:00
Eric Anholt	b5cb7f88de	intel: Add a regression test for 2D decode, which I'm about to refactor.	2012-01-27 13:21:19 -08:00
Eric Anholt	683855f655	intel: Add regression tests for batch decode. The .batch was generated using the dump-a-batch branch of git://people.freedesktop.org/~anholt/mesa using glxgears on gen7 hardware, using INTEL_DEVID_OVERRIDE for non-gen7 (this means that offsets in the buffers for non-gen7 are 0!). The .ref was generated by: ./test_decode tests/gen7-3d.batch -dump. The .sh exists because you can't supply arguments to tests using the simple automake tests driver. Something reasonable could be done using automake's parallel-tests driver (in fact, a previous version of the patch did that), but I was concerned that: 1) The parallel-tests driver is documented to be unstable -- they may change interfaces on us later. 2) The parallel-tests driver hides the output of tests in .log files scattered all over the tree, which was ugly and more painful to work with. v2: Actually add the batch files, add a .gitignore for the *-new.txt files added after failures, and fix failure mode for undetected chipset name. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> (v1)	2012-01-04 14:49:44 -08:00
Eric Anholt	ccbc40340b	intel: Add a regression test program for intel_decode.c. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-01-04 14:49:44 -08:00
Eric Anholt	ea33a231d5	intel: Add an interface for setting the output file for decode. Consumers often want to choose stdout vs stderr, and for testing I want to output to an open_memstream file. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2012-01-04 14:49:44 -08:00
Johannes Obermayr	a9dd34a7ee	intel/intel_decode.c: Remove #include "intel_decode.h". Signed-off-by: Chad Versace <chad.versace@linux.intel.com>	2011-12-30 21:07:55 -08:00
Eric Anholt	4149faf337	intel: Disable unused decode_logic_op(). It was producing an unused code warning. I'm tempted to just remove it, since it's unused, but I might use it soon. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:58 -08:00
Eric Anholt	88cffcc899	intel: Turn on normal warnings for intel_decode.c build. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:56 -08:00
Eric Anholt	7b48318806	intel: Remove c99ish variable declarations. I'd rather be able to use c99 variable declarations (there's a lot of awful code layout due to being c90ish), but I'll leave that for later. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:50 -08:00
Eric Anholt	0c46f020c8	intel: Fix printf format warnings for intel_decode. There was plenty of dropped useful data, and some horribly mis-formatted data. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:47 -08:00
Eric Anholt	39a06acfbd	intel: Add printflike warnings for instr_out. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:44 -08:00
Eric Anholt	1db55a80a9	intel: Fix a ton of signed vs unsigned and const char *warnings We've got a different (better) set of warning flags in place in this tree. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:42 -08:00
Eric Anholt	07768babb8	intel: Fix Wsigned-compare warnings (soon to be enabled). Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:39 -08:00
Eric Anholt	71066abe1e	intel: Get intel_decode.c minimally building. My plan is to use this drm_intel_dump_batchbuffer() interface for the current GPU tools, and the current Mesa batch dumping usage, while eventually building more interesting interfaces for other uses. Warnings are currently suppressed by using a helper lib with CFLAGS set manually, because the code is totally not ready for libdrm's warnings setup. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:36 -08:00
Eric Anholt	bbdda928d3	intel: Minor style tweaks after Lindent. Some comments weren't wrapped, and for some reason uint32_t data got an extra space (while other instances of "type identifier" didn't), and the indentation of the opcode-list structs got trashed. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:33 -08:00
Eric Anholt	193fa138d7	intel: Reformat intel_decode.c from intel-gpu-tools using Lindent. We generally go for kernel style in this tree, and this 4-space indent stuff was bothering me. The new results have some ugly bits, but they're in places where we desperately want to be using helper functions anyway. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:31 -08:00
Eric Anholt	1d318e2a79	intel: intel: Add IS_GEN[567] macros. These will be used by intel_decode.c, and were taken from intel-gpu-tools. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:29 -08:00
Eric Anholt	078bc5b6ee	intel: Make intel_chipset handle devid directly. This will make these macros reusable from intel_decode.c, which doesn't have a bufmgr_gem context, without faking the struct. We should generally only be using these macros from bufmgr_gem context setup anyway. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:43:27 -08:00
Eric Anholt	8c4a2c8848	intel: Import intel_decode.c from intel-gpu-tools. This is from commit `dd9a5b4f7f`. We've been sharing this file between that repo and Mesa, and it's time to build a real interface using it. I'm also hoping to apply some of its packet-walking logic for AUB dumping and batch validation purposes. Acked-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Eugeni Dodonov <eugeni@dodonov.net>	2011-12-29 16:42:57 -08:00
Chris Wilson	0ab2251b70	intel: Reset vma list upon purge During free we unconditionally delete the bo from the vma cache. This relies on the its list member being kept in a sane state. This fails after the object is purged, as the purge operation performs a pure deletion and doesn't reset the list member, leaving a pair of dangling pointers. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-12-14 08:23:02 +00:00
Chris Wilson	015286f03e	intel: Remove the fresh assertions used to debug the vma cacheing Hopefully all the bugs in the callers have been found, so time to handle the failures "gracefully" again. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-12-13 10:59:13 +00:00
Chris Wilson	c5f0ed1d29	intel: Update map-count for an early error return during mapping Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-12-13 10:41:33 +00:00
Chris Wilson	dd9a5b4f7f	intel: Evict cached VMA in order to make room for new mappings As the max number of VMA mappings is a hard per-process limit, we need to include the number of currently active mappings when evicting in order to make room for a new mmap. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-12-06 21:45:55 +00:00
Chris Wilson	e4b60f2960	intel: Add an interface to limit vma caching There is a per-process limit on the number of vma that the process can keep open, so we cannot keep an unlimited cache of unused vma's (besides keeping track of all those vma in the kernel adds considerable overhead). However, in order to work around inefficiencies in the kernel it is beneficial to reuse the vma, so keep a MRU cache of vma. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-12-05 22:22:10 +00:00
Chris Wilson	5c5332bbc3	intel: Clean up mmaps on freeing the buffer As a precautionary measure munmap on buffer free so that we never leak the vma. Also include a warning during debugging. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-12-05 10:43:13 +00:00
Chris Wilson	c549a777c1	intel: Unmap buffers during drm_intel_gem_bo_unmap We cannot afford to cache the vma per open bo as this may exhaust the per-process limits. References: https://bugs.freedesktop.org/show_bug.cgi?id=43075 References: https://bugs.freedesktop.org/show_bug.cgi?id=40066 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-12-05 10:25:53 +00:00
Daniel Vetter	36cff1cbb8	intel: limit aperture space to mappable area on gen3 Otherwise we blow up on heavy tiled blitter loads (with giant pixmaps). Signed-Off-by: Daniel Vetter <daniel.vetter@ffwll.ch> Acked-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-12-04 13:25:17 +01:00
Eric Anholt	d0ae6837d1	intel: Share the implementation of BO unmap between CPU and GTT mappings. Before this, consumers of the libdrm API that might map a buffer either way had to track which way was chosen at map time to call the appropriate unmap. This relaxes that requirement by making drm_intel_bo_unmap() always appropriate. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-10-29 12:04:09 -07:00
Eric Anholt	4cb01eeccf	intel: Don't call the SW_FINISH ioctl unless a CPU-mapped write was done. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch> Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-10-29 12:03:54 -07:00
Eric Anholt	77dc16f33d	intel: Remove stale comment. This used to be next to some map refcounting code, but that is long dead. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-10-29 12:03:34 -07:00
Eric Anholt	515cea6ac6	intel: Add an interface for removing relocs after they're added. This lets us replace the current inner drawing loop of mesa: for each prim { compute bo list if (check_aperture_space(bo list)) { batch_flush() compute bo list if (check_aperture_space(bo list)) { whine_about_batch_size() fall back; } } upload state to BOs } with this inner loop: for each prim { retry: upload state to BOs if (check_aperture_space(batch)) { if (!retried) { reset_to_last_prim() batch_flush() } else { if (batch_flush()) whine_about_batch_size() goto retry; } } } This avoids having to implement code to walk over certain sets of GL state twice (the "compute bo list" step). While it's not a performance improvement, it's a significant win in code complexity: about -200 lines, and one place to make mistakes related to aperture space instead of N places to forget some BO we should have included. Note how if we do a reset in the new loop , we immediately flush. We don't need to check aperture space -- the kernel will tell us if we actually ran out of aperture or not. And if we did run out of aperture, it's because either the single prim was too big, or because check_aperture was wrong at the point of setting up the last primitive. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-10-28 12:53:59 -07:00
Eric Anholt	2c2bdb36c5	intel: Use stdbool.h for dealing with boolean values. A few of the bitfield-based booleans are left in place. Changing them to "bool" results in the same code size, so I'm erring on the side of not changing things. Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-10-28 12:53:18 -07:00
Daniel Vetter	194aa1bee6	drm/i915: y tiling on i915G/i915GM is different Luckily the kernel has become extremely paranoid about such matters. Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-09-22 22:20:53 +02:00
Daniel Vetter	630dd26fb4	drm/intel: don't clobber bufmgr->pci_device Otherwise it's pretty hard to differentiate the different chipset variants. Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-09-22 22:20:09 +02:00
Ben Widawsky	2acaf160df	intel: shared header for shader debugging Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Cc: Chris Wilson <chris@chris-wilson.co.uk>	2011-08-01 14:44:58 -07:00
Yuanhan Liu	ce317a6d09	intel: fix the wrong method check for bo_get_subdata It's going to call bo_get_subdata method, but not bo_subdata Signed-off-by: Yuanhan Liu <yuanhan.liu@linux.intel.com>	2011-07-20 10:46:45 -07:00
Chris Wilson	9d77603d8b	intel: Add interface to query aperture sizes. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-06-04 13:01:11 +01:00
Chris Wilson	f395b0a25c	intel: Also handle mrb_exec fallback with ring == I915_EXEC_RENDER Reported-by: Oliver McFadden <oliver.mcfadden@nokia.com> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-04-04 08:59:39 +01:00
Daniel Vetter	06ebbf7c2c	Cleanup gen2 tiling confusion A tile on gen2 has a size of 2kb, stride of 128 bytes and 16 rows. Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-03-26 15:04:04 +01:00
Daniel Vetter	e6018c25ca	intel: Fixup for the fix for relaxed tiling on gen2 This is Fail. First patch to libdrm, and I've borked it up. Noticed-by: Chris Wilson <chris@chris-wilson.co.uk> Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-02-22 19:11:33 +01:00
Daniel Vetter	9a71ed93f4	intel: fix relaxed tiling on gen2 Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2011-02-22 18:53:56 +01:00
Chris Wilson	36d4939343	intel: Remember named bo ... and if asked to open a bo by the same global name, return a fresh reference to the previously allocated buffer. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-02-14 10:18:39 +00:00
Chris Wilson	53581b6210	intel: Set the public handle after opening by name Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-02-14 10:15:31 +00:00
Chris Wilson	550fe2ca3b	intel: compile fix for previous commit after rebasing Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-01-12 15:06:25 +00:00
Chris Wilson	6717b7579f	intel: Fallback to old exec if no mrb_exec is available Reported-by: Torsten Hilbrich <torsten.hilbrich@secunet.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=33016 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2011-01-12 11:00:13 +00:00
Chris Wilson	0184bb1c6d	intel: Export CONSTANT_BUFFER addressing mode Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-12-19 13:01:15 +00:00
Chris Wilson	537703fd48	intel: Reorder need_fence vs fenced_command to avoid fences on gen4 gen4+ hardware doesn't use fences for GPU access and the older kernel doesn't expect userspace to make such a mistake. So don't. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=32190 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-12-07 20:34:22 +00:00
Chris Wilson	af3d282afb	intel: If the command is fenced inform the kernel ... but only account for a fenced used if the object is tiled. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-12-03 10:51:49 +00:00
Chris Wilson	1443bea488	intel: Add a forward declaration of struct drm_clip_rect ... so that intel_bufmgr.h can be compiled standalone. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-11-25 16:59:20 +00:00
Chris Wilson	51b895041c	intel: Compute in-aperture size for relaxed fenced objects For relaxed fencing the object may only consume the small set of active pages, but still requires a fence region once bound into the aperture. This is the size we need to use when computing the maximum possible aperture space that could be used by a single batchbuffer and so avoid hitting ENOSPC. Reported-by: Daniel Vetter <daniel.vetter@ffwll.ch> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-11-22 09:54:47 +00:00
Eric Anholt	877b2ce15b	intel: Fix drm_intel_gem_bo_wait_rendering to wait for read-only usage too. Both the consumers of this API (sync objects and client throttling) were expecting this behavior. The kernel used to actually behave the desired (but incorrect) way for us anyway, but that got fixed a while back.	2010-11-09 13:57:19 -08:00
Albert Damen	49447a9b95	intel: initialize bufmgr.bo_mrb_exec unconditionally If bufmgr.bo_mrb_exec is not set, drm_intel_bo_mrb_exec returns ENODEV even though drm_intel_gem_bo_mrb_exec2 will work fine for the RENDER ring. Fixes xf86-video-intel after commit 'add BLT ring support' (5bed685f76) with kernels without BSD or BLT ring support (2.6.34 and before). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=31443 Signed-off-by: Albert Damen <albrt@gmx.net> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-11-07 14:59:55 +00:00
Eric Anholt	a52e61b5c8	intel: Drop silly asserts on mappings present at unmap time. The intent of these was to catch mismatched map/unmap. What it actually did was check whether there was ever a mapping of that type (including in a previous life of the buffer through the userland BO cache), not whether they were mismatched. We don't even actually want to catch mismatched map/unmap, unless we also do refcounting, since at one point Mesa would do map/map/use/unmap/unmap. Just remove this code instead.	2010-11-02 11:32:32 -07:00
Eric Anholt	4abb65f95c	intel: Remove gratuitous assert on bo_reference. This couldn't be triggered except by overflow, since there's an assert in unreference to catch the usual failure of over-unreferencing.	2010-11-02 11:19:21 -07:00
Eric Anholt	f45305c1aa	intel: Shove the fake bufmgr subdata implementation into the fake bufmgr.	2010-11-01 06:54:58 -07:00
Eric Anholt	6560b4766c	intel: Remove stale comment.	2010-11-01 06:50:04 -07:00
Chris Wilson	362457715f	intel: enable relaxed fence allocation for i915 The kernel has always allowed userspace to underallocate objects supplied for fencing. However, the kernel only allocated the object size for the fence in the GTT and so caused tiling corruption. More recently the kernel does allocate the full fence region in the GTT for an under-sized object and so advertises that clients may finally make use of this feature. The biggest benefit is for texture-heavy GL games on i945 such as World of Padman which go from needing over 1GiB of RAM to play to fitting in the GTT! Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-10-29 10:49:54 +01:00
Chris Wilson	057fab3382	intel: Prepare for BLT ring split. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-10-26 11:41:40 +01:00
Chris Wilson	96214860bb	intel: Downgrade error warnings to debug As the higher layers check the error return from libdrm-intel and are supposed to handle the error (and print their own warning in extremis) the voluminous output on stderr is just noise and a hazard in its own right. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-10-01 16:42:38 +01:00
Chris Wilson	6299722c47	intel: Replace open-coded drmIoctl with calls to drmIoctl() Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-09-25 21:38:31 +01:00
Eric Anholt	23287f05cf	Avoid use of c++ reserved keyword "virtual" when using a C++ compiler. Avoids requiring nasty hacks around libdrm headers in the new C++ parts of Mesa drivers.	2010-08-26 15:45:12 -07:00
Chris Wilson	c3ddfea1a6	intel: Suppress the error return from setting domains after mapping. If the mapping succeeds we have a valid pointer. If setting the domain failures we may incur cache corruption. However the usual failure mode is because of a hung GPU, in which case it is preferable to ignore the minor error from setting the domain and continue on oblivious. If these errors persist, we should rate limit the warning [or even just remove it]. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-29 20:16:56 +01:00
Chris Wilson	726210f87d	intel: Limit tiled pitches to 8192 on pre-i965. Fixes: Bug 28515 - Failed to allocate framebuffer when exceed 2048 width https://bugs.freedesktop.org/show_bug.cgi?id=28515 Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-24 11:38:00 +01:00
Chris Wilson	6ea2bda5f5	intel: Only adjust the local stride used for SET_TILING in tiled alloc Mesa uses the returned pitch from alloc_tiled, so make sure that we set it correctly before modifying the stride used for the SET_TILING call. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-22 13:03:52 +01:00
Chris Wilson	aba3502190	intel: Restore SET_TILING for non-flinked bo. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-22 13:00:22 +01:00
Chris Wilson	c7bbaca6a3	intel: '===' != '==' Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-22 11:15:56 +01:00
Chris Wilson	cd34cbeb9f	intel: Sanitise strides for linear buffers and SET_TILING Ensure that the user doesn't attempt to specify a stride to use with a linear buffer by forcing such to be zero. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-22 11:09:12 +01:00
Chris Wilson	13e8270504	intel: Print out debugging message following ENOSPC execbuffer() returns ENOSPC if it cannot fit the batch buffer into the aperture which is the error we want to diagnose here. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 22:37:18 +01:00
Chris Wilson	f16b4164d6	intel: Scan the cache for old bo once every second. Rearrange the cache cleanup so that we always scan following a final unreference, and guard against multiple scans in a single second. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 22:37:18 +01:00
Chris Wilson	5eec286838	intel: Force stride to be 0 for I915_TILING_NONE. When allocating a tiled buffer, if we remove the desired tiling mode due to it being beyond hardware limits, also remove the stride. This ensures that we only ever use stride 0 with I915_TILING_NONE. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 22:37:18 +01:00
Chris Wilson	1db22ff741	intel: Defer tiling change to allocation. As we now expose a method to allocate tiled buffers, it makes more sense to defer the SET_TILING until required. Besides the slim chance that it will be a no-op, by delaying the change we are less likely to stall on waiting for a bound buffer to release a fence register. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 22:37:18 +01:00
Chris Wilson	056aa9be04	intel: Track tiling stride We need to inform the kernel if the tiling stride changes and not only for changes of the tiling mode. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-21 14:25:39 +01:00
Eric Anholt	4f7704aea7	intel: Fix several other paths for buffers pointing at themselves.	2010-06-10 09:02:14 -07:00
Eric Anholt	0ec768e67a	intel: Add more intermediate sizes of cache buckets between powers of 2. We had two cases recently where the rounding to powers of two hurt badly: 4:2:0 YUV HD video frames would round up from 2.2MB to 4MB, and Urban Terror was hitting aperture size limitations. For UT, this is because mipmap trees for power of two texture sizes will land right in the middle between two cache buckets. By giving a few more sizes between powers of two, Urban Terror on my 945 ends up consuming 207MB of GEM objects instead of 272MB, and HD video decode on Ironlake goes from 99MB to 75MB. cairo-perf-diff of the benchmarks for gl and xlib shows a 1.09x and 1.06x speedup and a 1.07x, 1.08x, and 1.11x slowdown. From this, I think this patch was really a no-op in terms of performance for these CPU-bound workloads.	2010-06-10 08:56:56 -07:00
Chris Wilson	e65caeba9e	intel: Convert to untiled pitches if surface is too large for tiling. If the pitch is too large for the hardware to tile, recompute the required surface size based on the untiled pitch and alignments. For the older hardware, which has smaller limits and greater restrictions, this may be a considerable saving in allocation size. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-06-09 10:18:31 +01:00
Eric Anholt	f179137f8f	Allow a buffer to point at itself and still get relocs. I'm using this in experiments with the i965 Mesa driver.	2010-06-07 17:29:57 -07:00
Zou Nan hai	66375fd6e8	intel: Add support for kernel multi-ringbuffer API. This introduces a new API to exec on BSD ring buffer, for H.264 VLD decoding. Signed-off-by: Xiang Hai hao <haihao.xiang@intel.com> Signed-off-by: Zou Nan hai <nanhai.zou@intel.com>	2010-06-06 15:50:38 -07:00
Eric Anholt	58e54f62c9	intel_bufmgr_fake: fix compile warning.	2010-05-26 12:10:39 -07:00
Chris Wilson	fcf3e616ee	intel: Don't change tiling mode unless the kernel reports success. Fixes: Bug 26686 - Some textures are distorted with libdrm 2.4.18 in GTAVC&GTA3 http://bugs.freedesktop.org/show_bug.cgi?id=26686 This bug continues to haunt me. The kernel SET_TILING ioctl is inconsistent in its return values when reporting an error. If one of its sanity checks fail, then the input values are left unchanged. If the kernel later fails to change the tiling mode, then the input values are modified to match the current tiling on the object. In short, userspace cannot trust the return values upon error and so we must assume that upon error our current tiling mode matches reality and not update.	2010-05-24 18:38:29 +01:00
Chris Wilson	a3305b076c	Revert "intel: We don't need to take the bufmgr lock whilst mapping." This reverts commit `7ca558494d`. This was pushed ahead of an essential review of bo level locking in mesa, without which we cannot know whether removing this lock is safe.	2010-05-13 08:25:56 +01:00
Chris Wilson	07e7589d86	intel: query whether a buffer is reusable. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-05-11 08:54:06 +01:00
Chris Wilson	7ca558494d	intel: We don't need to take the bufmgr lock whilst mapping.	2010-05-06 22:56:41 +01:00
Chris Wilson	3506173ba7	intel: Use the correct size when allocating reloc_target_info array Thomas tracked down this error with kdm and commit b509640: ==4320== Invalid write of size 8 ==4320== at 0x9A97998: do_bo_emit_reloc (in /usr/lib/libdrm_intel.so.1.0.0) ==4320== by 0x9A97B9C: drm_intel_gem_bo_emit_reloc (in /usr/lib/libdrm_intel.so.1.0.0) ==4320== by 0xAED3234: intel_batchbuffer_emit_reloc (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF13827: brw_emit_vertices (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF1F14D: brw_upload_state (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF12122: brw_draw_prims (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xB256824: vbo_exec_vtx_flush (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0xB2523BB: vbo_exec_FlushVertices_internal (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0xB252411: vbo_exec_FlushVertices (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0xB195A3D: _mesa_PopAttrib (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0x8DF0F02: __glXDisp_Render (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== by 0x8DF517F: __glXDispatch (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== Address 0x126a8b80 is 0 bytes after a block of size 16,368 alloc'd ==4320== at 0x4C23E03: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==4320== by 0x9A97A64: do_bo_emit_reloc (in /usr/lib/libdrm_intel.so.1.0.0) ==4320== by 0x9A97B9C: drm_intel_gem_bo_emit_reloc (in /usr/lib/libdrm_intel.so.1.0.0) ==4320== by 0xAED3234: intel_batchbuffer_emit_reloc (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF191DB: upload_binding_table_pointers (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF1F14D: brw_upload_state (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xAF12122: brw_draw_prims (in /usr/lib/xorg/modules/dri/i965_dri.so) ==4320== by 0xB255EF6: vbo_exec_DrawArrays (in /usr/lib/xorg/modules/dri/libdricore.so) ==4320== by 0x8DF67A3: __glXDisp_DrawArrays (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== by 0x8DF0F02: __glXDisp_Render (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== by 0x8DF517F: __glXDispatch (in /usr/lib/xorg/modules/extensions/libglx.xorg) ==4320== by 0x446293: ??? (in /usr/bin/Xorg) which is simply due to only allocating space for the pointers and not the structs themselves. D'oh. Reported-by: Thomas Bächler <thomas@archlinux.org> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-04-11 18:48:31 +01:00
Eric Anholt	ef36c9a3b2	intel: Install the header file in the libdrm/ directory. Suggested-by: Rémi Cardona <remi@gentoo.org> Signed-off-by: Eric Anholt <eric@anholt.net>	2010-03-17 12:49:10 -07:00
Julien Cristau	d271336925	libdrm_intel.pc: don't include ${includedir}/drm intel_bufmgr.h is installed in ${includedir} directly, and the other headers are taken care of by libdrm.pc's Cflags. Signed-off-by: Julien Cristau <jcristau@debian.org> Signed-off-by: Eric Anholt <eric@anholt.net>	2010-03-17 12:45:23 -07:00
Eric Anholt	7c697b1670	intel: Align untiled buffer pitch to 64B. This is the largest untiled pitch requirement from gen2 through gen4. It's only the case for gen3 rendering to color regions with depth, but it's rare for this to be a significant factor in memory usage -- for example, gen4 requires 1 or 2 times the element size, or up to 64 bytes depending on the size of the elements. This is easier than encoding all the various little quirks for untiled pitch alignment, since we rarely do untiled now.	2010-03-17 11:15:45 -07:00
Pauli Nieminen	21105bc186	libdrm: Move intel_atomic.h to libdrm core for sharing. intel_atomic.h includes very usefull atomic operations for lock free parrallel access of variables. Moving these to core libdrm for code sharing with radeon. Signed-off-by: Pauli Nieminen <suokkos@gmail.com>	2010-03-17 11:48:00 +02:00
Chris Wilson	a4041e096c	intel: Repeat execbuffer if interrupted by signal Repeat while EINTR, not EAGAIN! One more source of corruption erradicated, hurray! Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-03-07 14:17:24 +00:00
Eric Anholt	1d4d1e6b13	intel: Only align Y-tiling pitch to the Y tile width. Fixes piglit depth-tex-modes on gen4.	2010-03-04 16:27:45 -08:00
Chris Wilson	3e21e3ba00	intel: Propagate some more error returns Ensure that errors from the kernel are propagated back to the caller, and not masked with return 0; Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-03-04 21:17:48 +00:00
Eric Anholt	47102863ed	intel: Update the needs_fence flag of buffers on the validate list. Fixes fbo-copyteximage on i915 with texture tiling and execbuf2 fenced relocs.	2010-03-03 10:08:44 -08:00
Eric Anholt	766fa7909f	intel: Don't enable execbuf2 fenced relocs unless we have execbuf2.	2010-03-02 16:04:14 -08:00
Eric Anholt	292a5d73a2	intel: Don't tile-align pitch for untiled buffers. This allows Mesa to use drm_intel_bo_alloc_tiled() for its tiled buffers, since it makes its decision about pitch before telling libdrm. They happen to be the same choices for the tiled case.	2010-03-02 15:39:37 -08:00
Eric Anholt	acbaff24e0	intel: Fix typo in conversion from IS_GEN to bufmgr_gem->gen. Luckily I caught the bug with the first consumer of the interface.	2010-03-02 15:39:37 -08:00
Eric Anholt	a0abb1b14e	intel: add a comment about tiled buffer alloc height alignment from Mesa.	2010-03-02 15:39:37 -08:00
Eric Anholt	a1f9ea765d	intel: Use an integer for chipset generation instead of many conditionals. Saves a bunch of comparisons in hot paths.	2010-03-02 10:12:46 -08:00
Jesse Barnes	b50964027b	libdrm/intel: execbuf2 support This patch to libdrm adds support for the new execbuf2 ioctl. If detected, it will be used instead of the old ioctl. By using the new drm_intel_bufmgr_gem_enable_fenced_relocs(), you can indicate that any time a fence register is actually required for a relocation target you will call drm_intel_bo_emit_reloc_fence instead of drm_intel_bo_emit_reloc, which will reduce fence register pressure. Signed-off-by: Eric Anholt <eric@anholt.net>	2010-03-02 10:10:50 -08:00
Eric Anholt	f6dc964e1d	intel: Add initial support for Sandybridge, and clean up the #defines.	2010-02-25 10:41:03 -08:00
Chris Wilson	4f0f871730	intel: Handle resetting of input params after EINTR during SET_TILING The SET_TILING is pernicious in that it overwrites the input arguments following an error in order to report the current tiling state of the buffer. This caught us by surprise as we then fed those arguments back into to the ioctl unmodified following an EINTR and so the kernel then reported success for the no-op. We interpreted this success as meaning that the tiling on the buffer had changed so updated our state and started using the buffer incorrectly in the new tiled/untiled manner. This lead to all sorts of random corruption and GPU hangs, even though the batch buffers would look sane (when the GPU had not wandered off into forbidden territory). References: Bug 25475 - [i915] Xorg crash / Execbuf while wedged http://bugs.freedesktop.org/show_bug.cgi?id=25475 Bug 25554 - i830_uxa_prepare_access: gtt bo map failed: Input/output error http://bugs.freedesktop.org/show_bug.cgi?id=25554 (And probably every other weird bug in the last few months.) Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-02-10 09:52:36 +00:00
Chris Wilson	fdcde592c2	intel: Account for potential pinned buffers hogging fences As the kernel reports the total number of fences, we must guess how many fences are likely to be pinned. In the typical system these will be only used by the scanout buffers, of which there may be one per pipe, and any number of manually pinned fenced buffers. So take a conservative guess and reserve two fences for use by the system. Note this reduces the number of fences to 3 for i915 and prior. Reference: http://bugs.freedesktop.org/show_bug.cgi?id=25911 The latest intel driver 2.10.0 causes kernel oops and system hangs Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>	2010-02-09 08:37:37 +00:00
Dave Airlie	973d8d6bd0	intel: check return value for calloc	2010-02-02 10:57:12 +10:00

1 2 3 4 5 ...

267 Commits (99722bfec6f1bbfc5148701f85ac145140d46afb)