Haskell使用动态编程的性能 [英] Haskell performance using dynamic programming

查看：145 发布时间：2018/6/5 11:26:25 performance haskell dynamic-programming

本文介绍了Haskell使用动态编程的性能的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我试图用动态编程来计算两个字符串之间的Levenshtein距离。这是通过Hackerrank完成的，所以我有时间限制。我使用了一个我见过的技术：动态编程算法是如何实现的在惯用的Haskell？中，它似乎在工作。不幸的是，它在一个测试案例中超时。我无法访问特定的测试用例，因此我不知道输入的确切大小。

I am attempting to calculate the Levenshtein distance between two strings using dynamic programming. This is being done through Hackerrank, so I have timing constraints. I used a techenique I saw in: How are Dynamic Programming algorithms implemented in idiomatic Haskell? and it seems to be working. Unfortunaly, it is timing out in one test case. I do not have access to the specific test case, so I don't know the exact size of the input.

import Control.Monad
import Data.Array.IArray
import Data.Array.Unboxed

main = do
  n <- readLn
  replicateM_ n $ do
    s1 <- getLine
    s2 <- getLine
    print $ editDistance s1 s2

editDistance :: String -> String -> Int
editDistance s1 s2 = dynamic editDistance' (length s1, length s2)
  where
    s1' :: UArray Int Char
    s1' = listArray (1,length s1) s1
    s2' :: UArray Int Char
    s2' = listArray (1,length s2) s2
    editDistance' table (i,j)
      | min i j == 0 = max i j
      | otherwise = min' (table!((i-1),j) + 1) (table!(i,(j-1)) + 1) (table!((i-1),(j-1)) + cost)
      where
        cost =  if s1'!i == s2'!j then 0 else 1
        min' a b = min (min a b)

dynamic :: (Array (Int,Int) Int -> (Int,Int) -> Int) -> (Int,Int) -> Int
dynamic compute (xBnd, yBnd) = table!(xBnd,yBnd)
  where
    table = newTable $ map (\coord -> (coord, compute table coord)) [(x,y) | x<-[0..xBnd], y<-[0..yBnd]]
    newTable xs = array ((0,0),fst (last xs)) xs

我已经切换到使用数组，但是速度不够。我无法使用Unboxed数组，因为此代码依赖于懒惰。我有没有明显的表现错误？或者我还能如何加快速度？

I've switched to using arrays, but that speed up was insufficient. I cannot use Unboxed arrays, because this code relies on laziness. Are there any glaring performance mistakes I have made? Or how else can I speed it up?

推荐答案

编辑距离计算的反向方程为：

The backward equations for edit distance calculations are:

f(i, j) = minimum [
  1 + f(i + 1, j), -- delete from the 1st string
  1 + f(i, j + 1), -- delete from the 2nd string 
  f(i + 1, j + 1) + if a(i) == b(j) then 0 else 1 -- substitute or match
]

因此，在每个维度中，您只需要下一个索引： + 1 。这是一个顺序访问模式，而不是对数组的随机访问;并可以使用列表和嵌套右褶皱实现：

So within each dimension, you need nothing more than the very next index: + 1. This is a sequential access pattern, not random access to require arrays; and can be implemented using lists and nested right folds:

editDistance :: Eq a => [a] -> [a] -> Int editDistance a b = head . foldr loop [n, n - 1..0] $ zip a [m, m - 1..] where (m, n) = (length a, length b) loop (s, l) lst = foldr go [l] $ zip3 b lst (tail lst) where go (t, i, j) acc@(k:_) = inc `seq` inc:acc where inc = minimum [i + 1, k + 1, if s == t then j else j + 1]

您可以在 Hackerrank编辑距离问题，如下所示：

You may test this code in Hackerrank Edit Distance Problem as in:

import Control.Applicative ((<$>)) import Control.Monad (replicateM_) import Text.Read (readMaybe) editDistance :: Eq a => [a] -> [a] -> Int editDistance a b = ... -- as implemented above main :: IO () main = do Just n <- readMaybe <$> getLine replicateM_ n $ do a <- getLine b <- getLine print $ editDistance a b

，通过所有测试，表现不俗。

which passes all tests with a decent performance.

这篇关于Haskell使用动态编程的性能的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

Haskell使用动态编程的性能 [英] Haskell performance using dynamic programming

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Haskell使用动态编程的性能 [英] Haskell performance using dynamic programming

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭