summaryrefslogtreecommitdiffstats
path: root/video/out/opengl/nnedi3.h
diff options
context:
space:
mode:
authorBin Jin <bjin1990@gmail.com>2015-10-28 01:37:55 +0000
committerwm4 <wm4@nowhere>2015-11-05 17:38:20 +0100
commit27dc834f37cd2427798c8cb582a574409865d1e7 (patch)
treefcc4fdfb0a4c8b20958ee110d5d8068439779848 /video/out/opengl/nnedi3.h
parent3f73d6352306d470821f3ea5078b7b7f8031f0d7 (diff)
downloadmpv-27dc834f37cd2427798c8cb582a574409865d1e7.tar.bz2
mpv-27dc834f37cd2427798c8cb582a574409865d1e7.tar.xz
vo_opengl: implement NNEDI3 prescaler
Implement NNEDI3, a neural network based deinterlacer. The shader is reimplemented in GLSL and supports both 8x4 and 8x6 sampling window now. This allows the shader to be licensed under LGPL2.1 so that it can be used in mpv. The current implementation supports uploading the NN weights (up to 51kb with placebo setting) in two different way, via uniform buffer object or hard coding into shader source. UBO requires OpenGL 3.1, which only guarantee 16kb per block. But I find that 64kb seems to be a default setting for recent card/driver (which nnedi3 is targeting), so I think we're fine here (with default nnedi3 setting the size of weights is 9kb). Hard-coding into shader requires OpenGL 3.3, for the "intBitsToFloat()" built-in function. This is necessary to precisely represent these weights in GLSL. I tried several human readable floating point number format (with really high precision as for single precision float), but for some reason they are not working nicely, bad pixels (with NaN value) could be produced with some weights set. We could also add support to upload these weights with texture, just for compatibility reason (etc. upscaling a still image with a low end graphics card). But as I tested, it's rather slow even with 1D texture (we probably had to use 2D texture due to dimension size limitation). Since there is always better choice to do NNEDI3 upscaling for still image (vapoursynth plugin), it's not implemented in this commit. If this turns out to be a popular demand from the user, it should be easy to add it later. For those who wants to optimize the performance a bit further, the bottleneck seems to be: 1. overhead to upload and access these weights, (in particular, the shader code will be regenerated for each frame, it's on CPU though). 2. "dot()" performance in the main loop. 3. "exp()" performance in the main loop, there are various fast implementation with some bit tricks (probably with the help of the intBitsToFloat function). The code is tested with nvidia card and driver (355.11), on Linux. Closes #2230
Diffstat (limited to 'video/out/opengl/nnedi3.h')
-rw-r--r--video/out/opengl/nnedi3.h47
1 files changed, 47 insertions, 0 deletions
diff --git a/video/out/opengl/nnedi3.h b/video/out/opengl/nnedi3.h
new file mode 100644
index 0000000000..ae0104ef04
--- /dev/null
+++ b/video/out/opengl/nnedi3.h
@@ -0,0 +1,47 @@
+/*
+ * This file is part of mpv.
+ *
+ * mpv is free software; you can redistribute it and/or modify
+ * it under the terms of the GNU General Public License as published by
+ * the Free Software Foundation; either version 2 of the License, or
+ * (at your option) any later version.
+ *
+ * mpv is distributed in the hope that it will be useful,
+ * but WITHOUT ANY WARRANTY; without even the implied warranty of
+ * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
+ * GNU General Public License for more details.
+ *
+ * You should have received a copy of the GNU General Public License along
+ * with mpv. If not, see <http://www.gnu.org/licenses/>.
+ *
+ * You can alternatively redistribute this file and/or
+ * modify it under the terms of the GNU Lesser General Public
+ * License as published by the Free Software Foundation; either
+ * version 2.1 of the License, or (at your option) any later version.
+ */
+
+#ifndef MP_GL_NNEDI3_H
+#define MP_GL_NNEDI3_H
+
+#include "common.h"
+#include "utils.h"
+
+#define NNEDI3_UPLOAD_UBO 0
+#define NNEDI3_UPLOAD_SHADER 1
+
+struct nnedi3_opts {
+ int neurons;
+ int window;
+ int upload;
+};
+
+extern const struct nnedi3_opts nnedi3_opts_def;
+extern const struct m_sub_options nnedi3_conf;
+
+const float* get_nnedi3_weights(const struct nnedi3_opts *conf, int *size);
+
+void pass_nnedi3(struct gl_shader_cache *sc, int planes, int tex_num,
+ int step, const struct nnedi3_opts *conf,
+ struct gl_transform *transform);
+
+#endif